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Abstract 

We consider the tree-valued Fleming-Viot process, (Xt)t>o, with mutation and 
selection as studied in IIDGP12II . This process models the stochastic evolution of 
the genealogies and (allelic) types under resampling, mutation and selection in the 
population currently alive in the limit of infinitely large populations. Genealogies and 
types are described by (isometry classes of) marked metric measure spaces. The 
long-time limit of the neutral tree-valued Fleming-Viot dynamics is an equilibrium 
given via the marked metric measure space associated with the Kingman coalescent. 

In the present paper we pursue two closely linked goals. First, we show that two 
well-known properties of the Fleming-Viot genealogies at fixed time t arising from 
the properties of the dual, namely the Kingman coalescent, hold for the whole path. 
These properties are related to the geometry of the family tree close to its leaves. In 
particular we consider the number and the size of subfamilies whose individuals are 
not further than e apart in the limit e — > 0. Second, we answer two open questions 
about the sample paths of the tree-valued Fleming-Viot process. We show that for all 
t > almost surely the marked metric measure space X t has no atoms and admits 
a mark function. The latter property means that all individuals in the tree-valued 
Fleming-Viot process can uniquely be assigned a type. All main results are proven 
for the neutral case and then carried over to selective cases via Girsanov's formula 
giving absolute continuity. 
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1 Introduction and background 

A frequently used model for stochastically evolving multitype populations is the 
Fleming-Viot diffusion. In the neutral case the corresponding genealogy at a fixed time 
t is described by the Kingman coalescent which was introduced some 30 years ago as 
the random genealogy relating the individuals of a population of large constant size in 
equilibrium HKin82al|Kin82bll . 

As a genealogical tree with infinitely many leaves, the Kingman coalescent exhibits 
some distinct geometric properties. In particular, S. Evans studied the random tree 
as a metric space, where the distance of two leaves is given by the time to their most 
recent common ancestor BEvaOOII . Using this picture, the Kingman coalescent close to 
its leaves has a nice shape: roughly speaking, in the limit e — > 0, approximately 2/e 
balls of radius 2e are needed to cover the whole tree; see Section 4.2 in D. Aldous' 
review article [Ald99]. Equivalently there are 2/e families whose individuals have a 
common ancestor not further than e in the past. Moreover, these 2/e families have sizes 
of order e. More precisely, the size of a typical family is exponential with parameter 
2/e (see !IAld991 eq (35)]), and the empirical distribution of the family sizes converges 
to this exponential distribution. However, these results have been proved only for the 
genealogy of a population at a fixed time. 

In a series of papers of the authors, in part with A. Winter, HGPW09I |DGP1 1 [ [GPWT21 
IDGP12B the Kingman coalescent was extended to a tree-valued process (X t )t>o, where 
X t gives the genealogy of an evolving population at time t. The resulting process, the 
tree-valued Fleming-Viot process, is connected to the Fleming-Viot measure-valued dif- 
fusion, which describes the evolution of type-frequencies in a large (i.e. infinite) popu- 
lation of constant size. In the simplest case of neutral evolution all individuals have the 
same chance to produce viable offspring, i.e., the frequency of offspring of any subset of 
individuals is a martingale. However, biologically most interesting is the selective case 
where the evolutionary success of an individual depends on its (allelic) type and where 
also mutation (i.e. random changes in types) may occur. This case including mutation 
and selection was studied in HDGP12B . 

Goals: The construction of the tree-valued Fleming-Viot process allows to ask if 
the above mentioned properties of the geometry of the Kingman coalescent trees are 
almost sure path properties of the tree-valued Fleming-Viot process. Furthermore, 
while we gave a construction of the tree-valued Fleming-Viot process under neutral- 
ity in BGPW12II and under mutation and selection in HDGP12II , some questions about 
path behavior remained open. We will carry over some (not all) of the geometric prop- 
erties of the fixed random trees to the evolving paths of trees in Theorems [1]- [4] of this 
work. 

In the next section, we explain in detail how we model genealogical trees. In order 
to formulate open questions let us briefly mention here that we use a marked metric 
measure space (mmm-space), that is, a triple (U,r,[i) where (U, r) is a complete met- 
ric space describing genealogical distances between individuals and p is a probability 
measure on the Borel-cr algebra of U x A, where A is the set of possible (allelic) types. 
In particular, the tree-valued Fleming-Viot process (X t )t>o takes values in the space of 
(continuous) paths in the space of mmm-spaces. 

To state two open questions from earlier work, assume that X t = (U t ,r t ,^t) is the 
state of the tree-valued Fleming-Viot process at time t > 0. First, we ask if the mea- 
sure lit has atoms for some t > 0. To understand what this means, recall that the state 
of the measure-valued Fleming-Viot process is purely atomic for all t > 0, almost surely. 
However, for the tree-valued state an atom in the measure m G M\{U t x A) implies 
that a family of individuals of positive ^ t -mass have zero genealogical distance. As we 
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will see in Theorem [5] this is not possible, and the tree-valued Fleming- Viot process 
is non-atomic for all t > 0, almost surely. Second, we ask if every individual in U t can 
uniquely be assigned a type which is of course the case for the Moran model, but does 
not automatically carry over to the (infinite population) diffusion limit. This is the case 
iff the support of \it is given by {(u,Kt(u)) : u e U t } for a function rc t : Ut —t A. In 
Theorem [6j we will see that this is indeed the case and every individual can be assigned 
a type for all t > 0, almost surely. 

Methods: Since the tree-valued Fleming-Viot process was constructed using a well- 
posed martingale problem, we will frequently use martingale techniques in our proofs. 
This allows to study the sample Laplace-transform for the distance of two points of the 
tree as a semi-martingale. In addition, population models have specific features that 
will also be useful. For example all individuals have unique ancestors even though not 
all individuals have descendants and if an individual has a descendant, she might as 
well have many. This simple structure can be used for finite population models (e.g. the 
Moran model) or the tree-valued Fleming-Viot process, since this infinite model arises 
as a large-population limit from finite Moran models (for the neutral case see HGPW121 
Theorem 2] and for the selective case HDGP121 Theorem 3]) to derive properties of the 
family structure. 

An important point of the proofs is that we can transfer properties from the neutral 
case since for most forms of selection (which are determined by the interacting fitness 
functions, which gives the dependence of the offspring distribution depends on the al- 
lelic type), the resulting process is absolutely continuous to the neutral case (which 
comes with no dependency between allelic type and offspring distribution) via a Gir- 
sanov transform. 

Outline: The paper is organized as follows: In Section [2] we recall the definition 
of the state space of the tree-valued Fleming-Viot process, its construction by a well- 
posed martingale problem and some of its properties. In Section [3] we give our main 
results. Theorem [T] states that the law of large numbers for the number of ancestors 
of Kingman's coalescent holds along the whole path of the tree-valued Fleming-Viot 
process. Moreover, we discover a Brownian motion within the tree-valued Fleming-Viot 
process based on the fluctuations of the number of ancestors; see Theorem [2j Another 
law of large numbers is obtained for a statistic concerning the family sizes and we make 
a big step towards this result in Theorem [3j Another Brownian Motion is discovered 
within the tree-valued Fleming-Viot process based on family sizes in Theorem |4j Finally 
we show the non-atomicity along the path in Theorem [5] and obtain existence of a mark 
function in Theorem [6) 

In Section|4]we prove Theorem[T]and after some preparatory moment computations 
in Section [5j we give in the subsequent sections the remaining proofs of the main re- 
sults. We note that various proofs have been carried out using Mathematica and can be 
reproduced by the reader via the accompanying Mathematica-file. 

2 The tree-valued Fleming-Viot process 

In this section, we recall the tree-valued Fleming-Viot process given as the unique 
solution of a martingale problem on the space of marked metric measure spaces. The 
material presented here is a condensed version of results from BGPW09UDGP11IIGPW12H 
and HDGP12B . We only recall notions needed to follow our arguments in the present pa- 
per. Let us fix some notation first. 

Notation 2.1. 

For a Polish space E the set of all bounded measurable functions is denoted by 13(E), 
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its subset containing the bounded and continuous functions byCb(E), the set of cadlag 
function I C E — > E by T>e{I) (which is equipped with the Skorohod topology) and 
the subset of continuous functions by Ce(I)- The set of probability measures on (the 
Borel a-algebra of) E is denoted by M\{E) and =>■ denotes either weak convergence of 
probability measures or convergence in distribution of random variables. If<f>:E—>E' 
for some Polish space E' then the image measure of /j, G M\{E) under <fi is denoted by 
For functions A n- a\ and A i-» b\, we write a\ < b\ if there is C > such that 

cc\ < Cb\ uniformly for all A. Furthermore for Ao € M U {±00} we write a\ A « A ° 6a if a a 
and b\ are asymptotically equivalent as A — s- A 0/ i.e. ifa\/b\ — > 1 as A — >• A . For product 

spaces E\ x f? 2 x . . . we denote the projection operators by -kex , ^e 2 7 When there is 

no chance of ambiguity we use the shorter notation m, 7r 2 , 

2.1 The state space: genealogies as marked metric measure spaces 

Throughout, we fix a compact metric space A which we refer to as the (allelic) type 
space. An A-marked (ultra-)metric measure space, abbreviated as A-mmm space or just 
mmm-space in the following, is a triple (U, r, u), where (U, r) is an ultra-metric space and 
jit e A4i(U X A) is a probability measure on U x A. 

The state space of the tree-valued Fleming-Viot process is 

U.4 := {(U, r, fx) : (U,r,u.) is A-mmm space}, (2.1) 

where (U,r,/J,) is the eguivalence class of the A-mmm space (U,r,fi), and two mmm- 
spaces (Ui,ri,(ii) and (U2, ?*2, M2) are called equivalent if there exists an isometry (here 
supp of a measure denotes its support) 

ip : supp (71-^*^1) supp(7T(7 2>t /i2) (2.2) 

with (ip,id)*fii = fi2- The subspace of compact mmm-spaces 

V A ,c ■= {(U,r 7 fi) G tU : (t/, r) compact} C U A (2.3) 

will play an important role. 

Remark 2.2 (Interpretation of equivalent marked metric measure spaces). 

1. In our presentation, only a/tra -metric spaces (U, r) will appear. The reason is that we 
only consider stochastic processes whose state at time t describes the genealogy of the 
population alive at time t, which makes r an ultra-metric. 

2. There are several reasons why we consider equivalence classes of marked metric 
spaces instead of the marked metric spaces themselves. The most important is that we 
view a genealogical tree as a metric space on its set of leaves. Since in population ge- 
netic models the individuals are regarded as exchangeable (at least among individuals 
carrying the same allelic type), reordering of leaves does not change (in this view) the 
tree. 

In order to construct a stochastic process with cadlag paths and state space I] a, 
we have to introduce a topology. To this end, we need to introduce test functions with 
domain Ua- 

Definition 2.3 (Polynomials). 

We set rW := {r := (ry)i<i< 3 - : r t] G R}. A function $ : V A -> K is a polynomial, if there 

is a measurable function <fi : ftW xi"->M depending only on finitely many coordinates 
such that 

:= (^ N ,^) := f t i® 1 \du 1 da)^(r(u l ,u 3 ) 1 < l<3 ,(a l ) l > 1 ), ^ 
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where is the infinite product measure, i.e. the law of a sequence sampled indepen- 
dently with sampling measure \i. 

Let us remark that functions of the form J2.4H are actually monomials. However, 
products and sums of such monomials are again monomials, and hence we may in fact 
speak of polynomials; cf. the example below. 

Remark 2.4 (Interpretation of polynomials). 

Assume that only depends on the first (™) coordinates in r(ui, itj)i<i<j and the first 
n in (a;);>i. Then, we view a function of the form 02.41 1 as taking a sample of size n 
according to [i from the population, observing the value under of this sample and 
then taking the ^-sample mean over the population. 

Example 2.5 (Some functions of the form I I2.4D ). 

Some functions of the form H2.4D will appear frequently in this paper, for example r H> 

4>{g) :=^a 2 (z) : = e _Ari2 , 

*i 2 ((LW0) = (» m ,i>l 2 ) = / (7r 1 ^) m (du 1 ,du 2 )e- Xr ^\ (2.5) 

This function arises from sampling two leaves, u\ and u 2 , from the genealogy (U, r) 
according to 7Ti*/i and averaging over the test function e~ Xr ( Ul ' U2 ' of this sample. Then 
(\j/ 12 ) 2 i s again of the form ( 12.41 1 and 

(V{ 2 ) 2 (W^j) = J{{ni)^{du 1 ,... 1 duA)e- x ^'^^'^\ (2.6) 
Another function that will be used and which also depends on types is given by 

$i 2 ((t/,r,//)) := J fi m (du 1 ,du2,da 1 ,da 2 )l {ai = a2} e- Xr( - u ^. (2.7) 

In this function u\ and u 2 contribute to the integral only if their types, a\ and a 2 agree. 

Since we use polynomials as the domain of the generator for the tree-valued Fleming- 
Viot process, we need to restrict this class to smooth functions. 

Definition 2.6 (Smooth polynomials). 

We denote by 

II 1 := as in ([Z4J : bounded, measurable and for all a e A N , (/>(-, a) e C\ (rO) | 

(2.8) 

the set of smooth (in the first coordinate) polynomials. Furthermore we denote by 
the subset ofU 1 consisting of all ^ for which (j)(r,a) depends at most on the first (£) 
coordinates ofr and the first n of a and hence have degree at most n. 

Definition 2.7 (Marked Gromov-weak topology). 

The marked Gromov-weak topology on Ua is the coarsest topology such that all $^ e n 1 
with (in both variables) continuous 4> are continuous. 

The following is from HDGP111 Theorems 2 and 5] 

Proposition 2.8 (Some topological facts about U). 
The following properties hold: 

1 . The space Ua equipped with the marked Gromov-weak topology is Polish. 

2. The set H 1 is a convergence determining algebra of functions, i.e. for random 
UA-valued variables X, X\ , X 2 , ■ . ■ we have 

X n ^^X iff E[$(X„)] E[$(X)] for all $ e n 1 . (2.9) 
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2.2 Construction of the tree-valued FV-process 

The tree-valued Fleming-Viot process will be denned via a well-posed martingale 
problem. Let us briefly recall the concept of a martingale problem. 

Definition 2.9 (Martingale problem). 

Let E be a Polish space, P £ M\{E), T C 13(E) and f2 a linear operator on B(E) with 
domain T . The law P of an E-valued stochastic process X = (X t )t>o is a solution of 
the (P , Q, T) -martingale problem if X has distribution P , X has paths in the space 
T>e([0, oo)), almost surely, and for all F £ T , 

F(X t )- f nF(X s )ds) (2.10) 

is a P-martingale with respect to the canonical filtration. The (P ,fi,.F) -martingale 
problem is said to be well-posed if there is a unigue solution P. 

Let us first specify the generator of the tree-valued Fleming-Viot process. It is a 
linear operator with domain IT 1 , given by 

n := o grow + n rcs + n ttlut + n scl . (2.1 1) 

Here, for $ = $^ £ n*, the linear operators fts row , fT cs , ft mut , f2 scl are defined as follows: 

1 . We define the growth operator by 

^ row $((7J^)) : = (^N (V^,2», (2.12) 

with 

(V £ ^,2) :=2 E ^fc^)' (2 - 13) 

i<j<j lJ 

2. We define the resampling operator by 

1 « 

f] res $((CT^)) ^(^ N ,0o0 M -</>), (2.14) 



2 

k,e=i 



with 6> fc ^(r,a) = (£,a), where 



r 



.r iAk ,ivk, if J = ^, and a { := < (2.15) 
3 I dfc, « = t. 
rjAkjvk, ifi = l, 

3. For the mutation operator, let # > and /?(-, •) be a Markov transition kernel from 
A to the Borel sets of A and set 

n 

rr ut $((T7^)) :=tf.]T(^ N ,i? fe 0), (2.16) 
fc=i 

where 

GM)(r,a)- f <j>(r,!&)p{a k ,db), (2.17) 

a£ := (ai,...,a fe _i,6,a fe+ i,...). 
We say that mutation has a parent-independent component if /?(-,•) is of the form 

fi{u, dv) = z/3{dv) + (1 - z)/3(u, dv) (2.18) 

for some z £ (0, 1], f3 £ Mi(A) and a probability transition kernel j3 on A. 
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4. For selection, consider the fitness function 

x ' : A x A x R+ -> [0,1] (2.19) 

with x'(a, 6, r) = x'(6, a,r) for all a, & e A, r e R+. We require that x' G C°' 01 (A x 
A x R+), i.e. x' is continuous and continuously differentiable with respect to its 
third coordinate. Then, with a > (the selection intensity) and 

Xk,i(L,a) = x'( a k,ai,r kMtkw i) (2.20) 

we set 

n 

fe=l 

If x'(a, 6, r) does not depend on r, and if there is x '■ A — >• [0, 1] such that 

X'(a,&,r)= x (a)+ X (6), (2.22) 
we say that selection is additive and conclude that with 

Xfcfca) = xK), (2.23) 

we have 

n 

tt scl 4>((!w0) :=«-^(^ N ,0-X fc -0-Xn + i>. (2.24) 
fe=i 

Remark 2.10 (Interpretation of generator terms). 

The growth, resampling, mutation and selection generator terms are interpreted as 
follows: 

1. Growth: When time passes, the genealogical distance of any pair of individuals 
gets larger. Precisely, the time to the most recent common ancestor (MRCA) for 
any pair grows at speed 2 because every single branch grows at speed 1 . 

2. Resampling: The term (/^® N ,0 ° ®k,i — 4>) describes the action of an event where 
an offspring of individual k replaces individual i in the sample corresponding to 
the polynomial This term is analogous to the measure-valued case (see e.g. 
IIEK931 eq. (3.21)]), but acts on both, the genealogy and the types. 

3. Mutation: It is important to note that mutation only affects types, but not ge- 
nealogical distances. Hence, the mutation operator agrees with the measure- 
valued case (see e.g. BEK93I eq. (3.16)]). Note that here we consider only jump 
operators B. 

4. Selection: This term is best understood when considering a finite population. Con- 
sider for simplicity the case of additive selection (i.e. H2.22H holds) in particular 
covering haploid models. Here, the offspring of an individual of type a replaces 
some randomly chosen individual at rate ax(a) due to selection. In the large pop- 
ulation limit, we only consider a sample of n individuals and this sample changes 
only if some offspring of an individual outside the sample, e.g. the (n + l)st indi- 
vidual by exchangeability, replaces an individual within the sample, the fcth say, 
due to selection. After this selection event, the fitness of the fcth individuals is 
x(a) which is also seen from the generator term. In the case of selection acting on 
diploids, the situation is similar, but one has to build diploids from haploids first 
and then apply the fitness function. 
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In BGPW121 IDGP12B the tree-valued Fleming-Viot processes were constructed via 
well-posed martingale problems. The following proposition summarizes Theorems 1, 2 
and 4 from HDGP12H . 

Proposition 2.11 (Tree-valued Fleming-Viot process). 

For P g Mi(U) the (P , CI, U 1 ) -martingale problem is well-posed. Its solution X = 
(X t )t>o defines a Feller semigroup, i.e. X i-> E[f(X t )\Xo] is continuous for all / G Cb(U), 
and hence, X is a strong Markov process. 

Furthermore, the process X satisfies the following properties: 

1. P(t H> X t is continuous) = 1. 

2. P(X t G V A ,c for all t > 0) = 1. 

3. Let $ = $^ G such that is symmetric under permutations. Then, the 
guadratic variation of the semi-martingale §(X) := (§(X t ))t>o is given by 

MX)] t = (p s ,{ Ps - (^ s ,p s )))ds, (2.25) 
Jo 

Ps{u\) ■■= j pf N (d(u 2 ,u 3: . . .))0((r s (ui,%))i<i<j). (2.26) 

4. Let P a he the distribution of X with selection intensity a. Then, for all a, a' > 0, 
the laws P a and P a i are absolutely continuous with respect to each other. 

5. If either (i) a = and the process with generator fl mut has a unigue eguilibrium or 
(ii) a > and mutation has a parent-independent component, then the process X 
is ergodic. That is, there is an \JA, c -valued random variable X^, depending on the 
model parameters but not the initial law, such that X t > X^. 

Definition 2.12 (Tree-valued Fleming-Viot process and marked Kingman measure tree). 
Using the same notation as in Proposition ^. 11\ we call the process X the tree-valued 
Fleming-Viot process and in the case a = its ergodic limit 

Xqc — (Uoc ) ^oo j poo ) (2.27) 

is called Kingman marked measure tree. 
Remark 2.13 (The Kingman measure tree). 

The random variable X^ arises from the marked ultrametric measure space which is 
associated with the partition-valued entrance law of the Kingman coalescent BGPW09II . 

Example 2.14 (The quadratic variation of (^ 2 (X t )) t > ). 

In some of the proofs, we will need to compute the quadratic variation of &(X) := 
(<fr(X t ))t>o for specific <t> G n 1 via H2.25D explicitly. For as in Example 12. 51 we have 
by fl2??5l ) 

\*\ 2 {X)]t = f\^ 23 (X s ) - * 12M (X S )) ds, (2.28) 
Jo 

with (cf. Definition |5.6D 

Vf kl ((U,r, n)) = ( M ®N )e -A(r(« i ,«,)+r(« fc> u,))) ) i, j, k,l = 1,2, .. . (2.29) 
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3 Results 



Our main goal is to establish almost sure properties of the paths of the tree-valued 
Fleming-Viot process, beyond continuity of paths and the property that the states are 
compact marked metric measure space for every t > 0, almost surely. We start by 
studying the geometry of the marked metric measure tree at time t of the tree-valued 
Fleming-Viot process. First we recall in Section 13.11 some well-known facts concern- 
ing the geometry of the Kingman coalescent and then extend them in Section 13.21 to 
the tree-valued Fleming-Viot process. In Section [3.31 we take advantage of our results 
and techniques and state some further path properties of the tree-valued Fleming-Viot 
process answering two open questions. 

3.1 Geometric properties of the Kingman coalescent near the leaves 

We focus on the equilibrium marked metric measure introduced in Proposi- 
tion 12.111 5, but for most assertions in this subsection we can ignore the marks (i.e. 
think of A consisting of only one element). Since the introduction of the partition- 
valued Kingman coalescent in HKin82a 1, this random tree has been studied extensively 
for instance in jAld99] and HEvaOOIl . In our present formalism (using metric measure 
spaces), appeared first in BGPW09II . In this section, we mostly reformulate known 
results, but also add a new one in Proposition [376] 

The Kingman measure tree, X^, arises from the partition-valued Kingman coales- 
cent, but can also be realized as a discrete graph tree using the following construction. 
Let S2, S3, . . . be independent exponentially distributed random variables with param- 
eter 1. Starting with two lines from the root the tree stays with these two lines for 
time S2 / (2) • At time S2 one of the two lines chosen at random splits in two, such that 
three lines are present. In general after the jump from k — 1 to k lines the tree stays 
with that k lines for a period of time Sk/ (2) an d then one of the k lines chosen at 
random splits, such that there are k + 1 lines. The total tree height is thus Xi, where 

T n '■= S' n+ i/(™2 1 ) + SVi+2 / ("2 2 ) "I ' i- e - T n is the time it takes the coalescent to go from 

n to infinitely many lines. The time of the root is called the time of the most recent com- 
mon ancestor (MRCA) and T is the present time of the population. Before time T, we 
consider the uniform distribution on the branches and construct a tree-indexed Markov 
process, by using a collection of independent mutation processes as follows. Start with 
an equilibrium value of the mutation processes at the root up to the next splitting time 
where we continue with two independent mutation processes both starting from the 
type in the vertex, etc. Running from the root to the leaves and letting time approach T 
we finally obtain . 

At time e (counted from the top of the tree, for e < T\), a random number N e of lines 
are present. Equivalently N e is the minimal number of 2e-balls needed to cover (the 
leaves of) the random tree X^. It is a well-known fact using de Finetti's Theorem that 
the frequency of the family descending from every of the N e lines can be defined for 
all e > 0. In addition, these frequencies are distributed as the spacings between N E on 
[0, 1] uniformly distributed random variables | |Pyk65p . 

From these considerations several results on the geometry of near the leaves 
can be derived. We briefly recall and extend some of them and reprove them later in 
our setting. Roughly we will show that there are 2/s ± C(l/v / e : )-many families in which 
the genealogical distance between the individuals is at most 2e. Furthermore, each of 
the families has mass of order e, as e — >• 0. More precisely, the distribution of (by e 
rescaled) family sizes is exponential with rate 2. 

We split the above picture in two parts. First we study the number of families and 
then their size in both cases looking at a LLN and then at a CLT We begin with a law 
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of large numbers and a central limit theorem for N e (see HAld991 eq. (35)]). Our proofs 
are given in Sections 14.21 and 14.31 

Proposition 3.1 (LLN for the number of balls to cover X^,). 

Let Xr, 



(U 



OO ) ' OO 5 



fj,oo) be the Kingman measure tree. Moreover, let N e be the (mini- 



mal) number of2e-balls needed to cover (U a 



Then 



-> 2, almost surely. 



Proposition 3.2 (CLT for the number of balls to cover X^). 
With the same notation as in Proposition ^. 1\ and Z ~ N(0, 1), 



N e - 2/e 

VWW) 



z 



(3.1) 



(3.2) 



and 



E 



r 



2/e 



> 1 

-> 1. 



E 



(N e - 2/e 



v/27(3i) 



^> 1. 



(3.3) 



We now come to the family structure of X^ = (!/«,, r m , /ioo) close to the leaves. For 
e > 0, we define B 2e (l), ■ • ■ ,B2 E (N E ) C {/oo as the disjoint balls of radius 2e that cover 
E/oo and the corresponding frequencies by 

F i (2e):=/i 00 (B 2£ (i)), i = l,...,JV e . (3.4) 

This can be viewed as the frequency vector of a sequence of exchangeable random 
variables and we can ask for the law of the empirical distribution of the scaled masses 
in the limit e — >• 0, where the underlying sequence, even if scaled, becomes i.i.d. and we 
should get the scaled law of a single scaled i^. It turns out, a first step (cf. Remark [3 .51 1 
is to see that the following law of large numbers holds, the proof of which (together 
with the proof of Lemma \3A\ appears in Section [6j 

Proposition 3.3 (Asymptotics of ball masses near the leaves). 

For Fi(2e) as above, 

i N '- 

TW 2 ^1 (3.5) 

£ — ' 

i=l 

almost surely. 



The classical proof of Proposition 13.31 uses the fact that the random vector 
(Fi(2e), . . . , Fjv t (2e)) has the same distribution as the vector of spacings between N e 
random variables uniformly distributed on [0, 1]. This vector in turn has the same dis- 
tribution as (ii/Ei^i **)>•• • > Y N e /(Y,?=i Y i))' where Y 1 ,Y 2 ,... are i.i.d. Exp(l) random 
variables. Then, using a moment computation, ( 13.51 1 can be proved. For details we refer 
to HEvaOOl Section 2]). We will use a different route for which we need the following 
auxiliary Tauberian result. 

Lemma 3.4 (Reformulation). 

The assertions 

i N * 

-V^&f^l, a.s. (3.6) 

£ '-^ 

i=l 

(A+l)^ 2 ^)^^! a.s. (3.7) 
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with ^ 2 from Examvle \2.5\ are equivalent. Moreover, the equivalence remains true if 
we replace e by e n 10, \ by A„ f 00 with e n X„ = 1 and let n — > 00. 



Remark 3.5 (Refinements of Proposition[ 
Actually, [Ald99] contains refinements of Proposition [33] 

1 . In equation (35) of BAld99ll it is claimed that (correcting a typo in Aldous' equation) 

j 311 ? I2 hn&eXex} ~ (1 - fi 23 



0<a;<oo 



^0 a.s. (3.8) 



This means that the Kingman coalescent at distance e from the tree top consists 
of approximately 2/e families, and the size of each family has an exponentially 
distributed size with expectation e/2 or the empirical measure of the family sizes 
converges weakly to the exponential distribution with mean 2, denoted by Exp(2). 

In order to show this assertion from moments of (F i (2e)) i= i i ... i jv e , it is necessary 
and sufficient that for k = 1,2,... 

1 Ne 

— J2( F >( 2£ ^ k ^>2- (fe - 1 )fc! a.s. (3.9) 
£ i=i 

The sufficiency follows since the moment problem for the exponential distribution 
is well posed, while for the necessity we assume that d3 . 8D holds, and then one 
concludes 

-^2Fi(2e) 2 = -Y] / xdx=-y2 ^{6x<Fi{2e)}xdx 

e ,=i £ l= i J » E .=i Jo (3.10) 



e->0 



Axe 2x dx 



as well as, for k > 2, 



— Y^F^ef = -j^Y, x k - 1 dx = keJ2 Me^F^e)}^ 1 dx 

i=i Jo i=i Jo 

2fca; fe - 1 e- 2:E da; = 2- {k -^k\. 



(3.11) 

2. The statement I I3.8D raises the issue to determine the fluctuations in that LLN, i.e. 
to derive a CLT Here, HAld99[ (36)] states that 

- (| E - a- e "")) x>0 ^ + tb(i - e-^w ( 3 - 12 ) 

j— 1 

where (-8° )o<t<i is a Brownian bridge. Another, for us more suitable formulation is 
to consider the sum multiplied by iV" 1 instead of e/2, so that Z disappears on the 
right hand side. In this case one would consider the fluctuations of the empirical 
measure of masses of the £>(2e)-balls that cover the Kingman coalescent tree. 

We have so far investigated the behavior on top of the tree close to the leaves looking 
at the family sizes with respect to fixed degree 2e of kinship for e — > 0. This picture can 
be refined by obtaining fluctuation results in d 3 . 5 D (or H3.7I Q. We obtain a partial result 
by considering a degree of kinship e/t for t varying in K + and letting e — >• 0. This gives a 
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profile of the family sizes of varying degrees of kinship and their correlation structure 
close to the leaves, if we view the scaling limit as a function of t > 0. This profile should 
be the deterministic flow of distributions {Exp(2/i) : t > 0} which are the limits of 

1 ( Ns/t \ 



N e/t 



i=l 



Again we consider the Laplace transform given through and obtain the following 
fluctuation result - proved in Section [6741 

Proposition 3.6 (Fluctuations of scaled small masses in small balls). 

Let be the Kingman measure tree. Define the process Z x := (Z x ) t >o by 

Z t A :=VA((Af + l)*33(*oo)-l)- (3-14) 

Then every sequence (Z Xn ) n > with A„ — > oo has a convergent subsequence (A^)„>o 
with 

Z V - Z, (3.15) 

for some process Z := (Z t )t>o with continuous paths. Furthermore all limit points 
satisfy 

E[Z t ] = 0, 



Var[Z ; 



Cov{Z S: Z t ) = ** (3.16) 
(s + i) d 



E[Z. f 3 ] = 0, 
3 
it 



mt\ = 772- 



Remark 3.7 (Is Z Gaussian?). 

We conjecture that there is a unique limit process Z in Proposition 13.61 Moreover, we 
note that V[Z t } and E[Zf] are in the relation if Z t ~ N(0, l/2£), which raises the question 
whether Z is a Gaussian process. 



3.2 Path properties: the tree-valued Fleming-Viot process near the leaves 

Although the Kingman measure tree, X^, only arises as the long-time limit of the 
tree-valued Fleming-Viot process, X = (X t ) t >o, near the leaves the geometry of X t (for 
t > 0) and of X^ is similar. The reason is that the structure near the leaves of or 
X t only depends on resampling events in the (very) recent past. Hence, we expect that 
the following results hold. 

Conjecture 3.8. 

The properties of X^ from Propositions 13.11 and 13.31 hold along the paths of X. Fur- 
thermore we conjecture that the more ambitious refinements described in Remark l3.5l 
(see I I3.9D ) also hold along the paths. 

We cannot prove this conjecture completely. However some (positive) answers are 
given in Theorems[T](on the result corresponding to Proposition |3.1H and[3](on the result 
corresponding to Proposition I3.3D . In addition, Theorems [2] and [4] give two results on 
convergence to a Brownian motion along the tree-valued Fleming-Viot process. 

The following theorem is proved in Section [4741 
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Theorem 1 (Uniform convergence of eN e along paths). 

Let X = (X t )t>o with X t = (C/ t ,r t ,^ t ) be the tree-valued Fleming-Viot process (started 
in some X € UJ. Moreover, let iV' be the number of2e-balls needed to cover (U t ,r t ). 
Then, 

PflimeiV* = 2 for allt > 0) = 1. (3.17) 



While the fluctuations in Proposition 13 . 2 1 are dealing with a fixed-time genealogy, we 
can view the fluctuations of the path (eN*) t >o arising in Theorem [T] in the limit e — > 0. 
This program is now carried out along the tree-valued Fleming-Viot process. 

In order to obtain a meaningful limit object, we consider time integrals. It is impor- 
tant to understand that the part of the time-t tree X t which is at most e apart from the 
treetop is independent of T a '■= cr(X r : < r < s) as long as s < t — e. The following is 
proved in Section [4751 

Theorem 2 (A Brownian motion in the tree-valued Fleming-Viot process). 



Let X = (X t )t>o with X t = (U t ,r t ,^ t ) be the neutral tree-valued Fleming-Viot process 
(i.e. a = 0) started in equilibrium, X = Xoo, and B s = (B e (t)) t >o given by 

B e (t) := ^| jf (Ar e s - E[jV e °°])ds. (3.18) 



Then, 



B e ==> B, (3.19) 



where B = (B t ) t >o is a Brownian motion started in B a = 0. 
Remark 3.9 (Expectation of N^°). 

In I I3.18D one would rather like to replace EfiV,? ] by 2/e to measure the fluctuations 
around the limit profile. However this changes the limit which we are not able to 
identify. Precisely, we will see that B e := (B e (t))t>o, given by 

B e (t) := */f / (X° - -)ds, (3.20) 



2 7o V £ e 

converges as e — >• to a Brownian motion B with drift, but unfortunately we cannot pin 
down the latter. Indeed from Proposition |3.2| in particular using boundedness of second 
moments, we see that, approximately, E[i\T^°] « | in the sense that e ■ E[7V^°] £ ^°> 2. 
However, this only implies EfTV,? ] = 2/e + o(l/e) and the error term can be large. In 
order to sharpen this expansion to EfiV,? ] = 2/e + 0(1), we use results from IITav84ll . 
His Section 5.4 (with 6 = and i = oo) yields 



e[jvt • • • W - 3 + 1)] = E p^ 2k - 1 



(k-l)---(k-j + l)-k---(k + j-2) 



(i-D! 

(3.21) 

with pfe(e) = cxp(— k(k — l)e/2). From this, writing 8 := \fe we also see that 

E^ri = f E cx P (-x(x - 8)/2)(x - 8/2)8 

= i E exp(-x 2 /2)(l + x8/2 + 0(8 2 )){x - 8/2)8 

xesn (3.22) 

= 4/ xe~* 2 ' 2 dx+-l (x 2 -l)e- x2 / 2 dx + 0(l) 
o Jo 6 Jo 

= §+0(1) 
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as e — > 0. This, together with Theorem [2j implies that B is of the form (13.20b . 

Now we come to a generalization of Proposition 13.31 to the tree-valued Fleming- 
Viot process. Together with Lemma 13741 we obtain the following result on the Laplace 
transform of two randomly sampled points. The proof is based on martingale arguments 
which will also be useful in the proof of Theorem^ Theorem[3]is proved in Section [674l 

Theorem 3 (Small ball probabilities). 

Let X = (X t )t>o with X t = (U t ,r t ,n t ) be the tree-valued Fleming-Viot process, started 
in some X e U, and let ty 12 he as in Remark \2.5\ Then 

lim Pf sup \(\+l)V{ 2 (X t ) -1\ >e) =0 for all T < oo,e > 0. (3.23) 



Remark 3.10 (Convergence in probability versus almost sure convergence). 

Denote by F*(2e), . . . , F l Nt (2e) the sizes of the X\ balls of radius 2e needed to cover 

(U t ,r t ). If we could show that P(lim A ^ oc (A + l)t>\ 2 (X t ) 1 for alU > 0) = 1, we 

could use Lemma 13741 in order to see that 



^2(F*(2s)) 2 ^> 1 for all t > Oj = 1. (3.24) 

£ i=l 

However, our proof of Theorem [3] is based on a computation involving the evolution 
of fourth moments of ty 12 in order to show tightness of {((A + 12 (X t )) t > £ : A > 0}. 
Based on these computations, we can only claim convergence in probability rather than 
almost sure convergence. 

Remark 3.11 (Possible refinement of TheoremfJ}. 

As an ultimate goal one would want to prove that (compare with 1 13.81 1) 



7V< 

I £ 



5 1 X™j 3U1 ? | 2 ^ 1 {F!(2s)<ex} - (1 - e 2X ) 



0<t<T 0<x<oo 



% a.s. (3.25) 



This would mean that the assertion that roughly the tree consists of 2/e families of 
mean e/2 exponentially distributed sizes holds at all times. Using our conclusions from 
Remark 13.51 this goal can be achieved if we show that H3.9I I holds for k = 1,2,... 
uniformly at all times. (While the case k = 1 is trivial, note that a combination of 
Theorem[3]and Lemma lTOI gives 1 13.91 1 for k = 2.) In principle, the technigue of our proof 
of Proposition 13.31 can be extended in order to obtain 1 13.91 1 for a given but arbitrary k 
which would require controlling higher order moments of <I'^ 2 . If we could do this for 
general k then we would obtain a proof of d3.8t . But since we are using Mathematica 
for these calculations the problem remains open. 

Again, we can formulate a result on fluctuations. Integrating over time (to get a 
process rather than white noise) the guantity (A + l)^ 12 (X t ) — 1, which appears in 
Theorem [3] and using the right scaling, we again obtain a Brownian motion as the weak 
limit. The following result is proved in Section [6751 

Theorem 4 (Another Brownian motion in the tree-valued Fleming-Viot process). 

Let X = (X t )t>o with X t = (Ut,r t ,/j, t ) be the neutral tree-valued Fleming-Viot process 

(i.e. a = 0) started in equilibrium, i.e., X a = X^ and let W\ = (W\(t)) t >o be given by 

W x (t) := A / ((A + l)^l 2 (X s ) - l)da, (3.26) 



with ty] 2 as in Example \2.5\ Then, 



o 



W x W, (3.27) 
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where W = (W t )t>o is a Brownian motion started in Wo = 0. 
Remark 3.12 (A heuristic argument). 

Assume that A is large. Then, (A + l)^fjf(X s ) — 1 depends approximately only on re- 
sampling events which happened within an interval [s — C/A, s] for some large C. In 
particular, on different time intervals (which are at least of order 1/A apart), the incre- 
ments of W\ are approximately independent. Thus, it is reasonable to expect that the 
limiting process is a local martingale. In fact, using some stochastic calculus we can 
show that the limiting process is continuous (i.e. the family {W\ : A > 0} is tight in the 
space Cr([0,oo))) and the limiting object of (Wf (i) — t) t >o is a local martingale as well. 
By Levy's characterization of Brownian motion, W must be a Brownian motion. 

3.3 Path properties: non-atomicity and mark functions 

The calculus developed for the statements in Section 13.21 allows to obtain two fur- 
ther properties of the states of the tree-valued Fleming-Viot process X = (X t )t>o, 
X t = (Ut,r t ,[i t ), namely that the states are atom-free and admit a mark function. More 
precisely, Theorem [5] says that at no time it is possible to sample two individuals with 
distribution \x t with distance zero; cf. Remark 13.131 below. Furthermore Theorem [6] 
says that we can assign marks to all individuals in the sense that [i t has the form 
/i t (du,da) = ('KUt)*[ J 't{du)5 Kt ( u )(da) for some measurable function n t from U t to A. These 
two theorems are proved in Section [7J 

Theorem 5 (X t never has an atom). 

Let X = (X t )t>o with X t = {U t ,r t ,^t) be the tree-valued Fleming-Viot process. Then, 



Remark 3.13 (Interpretation and idea of the proof). 

1. At first glance the fact that [i t is non-atomic for all i > might seem to con- 
tradict the fact that the measure-valued Fleming-Viot diffusion is purely atomic 
for every t > 0. However, both properties are of different kind. The above the- 
orem implies that randomly sampled individuals from the tree-valued Fleming- 
Viot process have distance of order 1, whereas genealogically the atomicity of the 
measure-valued Fleming-Viot diffusion expresses the fact that at every time t > 
one can cover the state with a finite number of balls with radius It. 

2. The proof is based on the simple observation: for a measure fj, € M\(E), 



Hence, the proof of J3.28D is based on a detailed analysis of the Laplace transform 
of the distance of two points, independently sampled with distribution \i t . 

The next goal is to establish that at any time there is a mark function. Briefly, the 
state (U,r,fx) of a tree-valued population dynamics admits a mark function k iff every 
individual u e U can be assigned a (unique) type k(u) e A. This situation occurs 
in particular in finite population models, e.g. in the Moran model. The question for 
the tree-valued Fleming-Viot model is whether types in the finite Moran model can 
change at a fast enough scale so that an individual can have several types in the large 
population limit. Such a situation can occur, if the cloud of very close relatives (as 
measured in the metric r) is not close in location (as measured in the type space A). 



P(/U t has no atoms for all t > 0) = 1. 



(3.28) 



H has no atom 




(3.29) 
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Definition 3.14 (Mark function). 

We say that (U, r, fi) G Va admits a mark function if there is a measurable function 
K : U — > A such that for a random pair (it, 21) with values inU x A and distribution fj, 

k(U) = 21 ^-almost surely. (3.30) 

Equivalently (U, r, fj,) G admits a mark function if there isv<E M\{U) with 

fi(du, da) = v(du) ® S K ( u )(da). (3.31) 

We set 

Ujmark ._ SJFJ~^ e Ua . ^ r ^ ^ admits a mar fe function}. (3.32) 

Remark 3.15 (mmm-spaces admitting a mark function are well-defined). 

Let us note that admitting a mark function is a property of an equivalence class. Assume 
(£/, r, /z) = (U'^r'^n') G Va (with an isometry ip : U' U as in I I2.2H ), where (U,r,/i) 
admits a mark function n : U A, i.e. d3 .3 It holds. Then, clearly for n! :~ noip we have 

/i'(du,da) — (ip, id)*/x(ciw, da) = (<p, id)*^(dw) <8> 5 K ( u )(da) = ip*v{dv) ® S K ^ v ^(da). 

(3.33) 

In other words, (U',r',fi') admits the mark function k' = k a (p. 
Theorem 6 (X t admits a mark function for all t). 

Let X — (X t )t>o, X t = (Ut,rt,fj,t) be the tree-valued Fleming-Viot-dynamics. Then, 

P(X t G U^ ari for all t > 0) = 1. (3.34) 
Remark 3.16 (Mark functions and the lookdown process). 

For a series of exchangeable population models it is possible to construct the state of 
an infinite population via the lookdown construction [DK96] IDK99B . This construction 
immediately allows to define a mark function on a countable number of individuals 
specifying their types at all times, which suggests that H3.34D should hold. However, the 
metric space read off from the lookdown process is not complete, and the mark function 
is not continuous. Therefore, there is no way to extend the mark function given through 
the lookdown process to the completion of the corresponding metric space. Instead, 
our proof of Theorem [6] in Section 17.21 uses again martingale arguments and moment 
computations. 

3.4 Strategy of proofs 

The proofs of our results are of two types. On the one hand, the proofs of Propo- 
sitions 13.11 and 13.21 Theorems [T] and [2] use as the basic tools the fine properties of 
coalescent times in Kingman's coalescent. This means they are carried out without 
specific martingale properties of the tree-valued Fleming-Viot process. On the other 
hand, Propositions 13. 3l and l3 .61 Theorems[3j[4l[5]and[6]are proved by calculating expec- 
tations (moments) of polynomials, which is possible by using the martingale problem 
for the tree-valued Fleming-Viot process. The polynomials we have to consider here 
(see also Remark [2.5D are either <f x x 2 or <I> 12 , i.e. polynomials based on the test functions 
^fca) = exp(— Ar i2 ) or tp(r,a) = exp(— Ari 2 )l{ ai=a2 } and products, powers and linear 
combinations thereof. For the calculations of the moments of this type we develop some 
methodology which we explain in Section [5j 

Propositions 13.11 and 13.21 Theorems [T] and [2] are proved in Section [4] while Proposi- 
tions [373] and [3T6J Theorems [3] and [4] are proved in Section[6] The latter results are then 
used to prove Theorems [5] and [6] in Section [7J 
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4 Proof of Propositions 13.11 and 13.21 and of Theorems [T] and [2] 

4.1 Preparation: times in the Kingman coalescent 

Recall that T n = Sn+i/i™^ 1 ) + 5„ +2 /("^ 2 ) H is the time the Kingman coalescent 

needs to go down to n lines, where 5*2, S3, . . . are i.i.d. exponential random variables 
with rate 1. Before we begin, we prove some simple results on the times T n . 

Lemma 4.1 (Moments and exponential moments of T n ). 

Let T n be the time the Kingman coalescent needs to go from infinitely many to n lines. 
Then, 

E[T„] = -, 
n 

E[(T n -2/n) 2 ] =JL(i + £>(!/„)), 
1 fi 

E[(T„-2/n) 3 ] = — (l + 0(l/n)), 

1 ft 

E[(T„ - 2/n) 4 ] = —(1 + 0(l/n)), (4.1) 
fi4 

E[(T„-2/n) 6 ] =_(l + 0(l/„)), 
E[(T„-2/n) 8 ] = -iL(l + 0(l/„)), 
E[e- AT »]<e-t^ a r) J A>0. 



Proof. Recall that E[(Sj - l) fe ] = fc! E!Lo( _1 )V* ! - w e start by writing 

r , ^ E [5i] ^ 2 ~ 1 1 2 

= E (|) = E ^ = 2 £ ^i-7 = n- (4 - 2) 

j=n+l \2/ i=ra+l v ' i=n+l 

Next, 



E[(T n - 2/n) 2 ] = Var[T„] = £ gg§ = 4 £ 



j2(i-l)2 Z_. i2( i _ 1 )2 

i=n+l v y i=n+l v ' 

- 4^°° ijdx + G(l/n 4 ) = JL (i + 0(l/ n )). 
For third moments, using E[(Sj — l) 3 ] = 2 

E[(T„-2/n) 3 ]=E[( £ _2 (fl ,_ 1) N« 
i=n+l ^ ' 



£ ^s^s^-i) 

i=n+l v ; 

ijL(l + 0(l/„)), 



(4.3) 



2 3 (4.4) 
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For fourth moments 
E 



[(T rl -2/n) 4 ] = E[( E x -(— T)^- 1 ) 



i— n+ 



2= n+ 



r E[(S, - l) 2 ] • E[(5j - 1) 



^4 4 

J^ +1 i 2 (i-l) 2 J 2 0'-1) : 



( £ + 0(l/n 7 ) 



(4.5) 



2— n+ 

16 

9n6 



(l + 0(l/n)) 



For sixth moments, 

E[(T„ - 2/n 



6 1 = E 



Z— Ti + 1 

OO 



( £ 1^7— ip E [^ - 1) 2 ]) +0(1/- 10 ) 

i=n+l V ' 

64 



(4.6) 



27n 9 



(l + 0(l/n)). 



With analogous calculations, the results for the 8th moment follows. 
Finally for the exponential moments, we compute for any A > 



— n+l \2J 



x 



i—n-\- 



i=n+l V27 



<cx P (- e 
< cxp ( - e 



i— n+l 

< exp 



OO 

E 



i=(n+l)V fvSX+ll 



exp 



A 

(*) +A 

4 A 
3 nV \VAX 



(4.7) 



4, A. VX, 



□ 



4.2 Proof of Proposition [3TT1 

Let T„ be as in the last subsection. Then, I l3.ll ! is equivalent to 



„ n— s-oo „ 

nl n > 2. 



In order to see this, note that Nt„ = n by definition and 

{T n > e} = {N e > n}. 



(4.8) 



(4.9) 



Since T„ i as n — > oo (and iV e f oo as e — > 0), the equivalence of J3. ID and ( 14.81 1 follows. 
For J4.8D , it suffices to note that 



(4.10) 



by Lemma 14711 Since the right hand side is summable, limsup„_ J . 00 \nT n — 2\ < e almost 
surely for all e > 0. In other words, nT n — » 2 almost surely. 
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4.3 Proof of Proposition I3T21 

By the Lindeberg-Feller central limit theorem, we see from the moment computa- 
tions of Lemma 14 . 1 1 that 

T n 2 1 TL n - 



V4/P) 



» Z. 



Recalling ( 14.91 1, we set 

a £ (x) := 

such that for every 

N £ - 2/e 

VVW) 



(4.11) 



(4.12) 



>x) = V{T aAx) >e) 



'Ta E ( x ) - 2/a £ (x) e-2/a e (x) 



> 



V ^4/(3^(1)3) y4/(3o e (x) 3 ) 
^ P(Z > x), 



(4.13) 



which implies 03 .2D . 

However, we also need to show convergence of moments up to fourth order. We 
write 

r °° '\N e -2/e 



wN £ -2/e\^ 




\\y/2/(3e)J - 


Jo 


uN e -2/ey- 


= 4 / 




Jo 



(^=M >x ) 



N £ - 2/e 



(-^7=^ >*) 



> xjdx, 
> x ] dx. 



(4.14) 
(4.15) 



To estimate the right hand side of H4.15D we first show that for a suitably chosen 6 > 
and \/6 — S < c < \/6 we have 



N £ - 2/e 



<-x\ = 



6/e 



/v /i " " ^ ^27(37) 



a; 3 P 



< -x ^> 0, 



(4.16) 



where the equality follows because for x > y/6/e the integrand is identically 0. Using 
the exponential Chebyshev inequality we obtain for all \ y . £ > 



x 3 F[ -. 1 <-x) dx = 



/6/e 



c/Vi 



'(N £ < * - xy / 2/{3efj dx 



y 3 



2-y 



< 



C-y/2/3 

V 3 

/2J3 e 2 



9 



^ F[Ne < [ _JL 1)dy=1 I _^ ¥ l T[2 _^ ] < £ )dy 



Cy/2/3 



now taking the lower bound for c, setting 5' = 25/3 and using I I4.7D we get 



< 



< 



2 



-Xy iB T 2-y , 



9 f ^ r 4/ 



dij 

4 / A,, f e X /A 



2-y 



(4.17) 
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Now choose X y<e = and let §/ = 0.39 be the solution of ^A^S 7 = 1.9. For 

y e (1.9, 2] we have 

\,ee ^\7e _ (2 - y + 1/2) 2 2 - y + 1/2 2 - y + 1/2 



2-„ A 2 = e(2-y) A ^^^— (4 - 18) 
Thus, 

4/A,, e e a/ A.„ Z \ 1 / 55 13 7 \ , „ „„, 

- 3 A 2 ) = - 7 ( - 12 + y y ~ y ) • (4 - 19) 

It is easy to see that on the interval [1.9, 2] the function y i-> (— 1| + ^py — y 2 ) is bounded 
below by a = 0.04 (its value in 1.9). It follows 

j 00 x z P ( N z - 2 / £ < - x ) dx < c i±e- a ^ ^ (4.20) 

for a suitable constant d > 0, and hence we have shown H4.16D . 

In order to show convergence of fourth (and second) moments, using ( 14.151 1, since 



> x) > ~P(\Z\ > x) pointwise, we need to show that there is an inte- 

VvWi ) ' v 

grable function dominating 



( N * ~ 2 / £ . 1 , T, ( N e ~ 2 / 



e 



x A P ' >x \ + P\ -. < -x l/ a<c /^5i , (4.21) 

for some c > and x > 0. For this, we get, by the Markov ineguality and I I4.6D 

N e - 2/e 

= p(T ae(x) -2/a e (x) >e-2/o e (x)) + p(t o , ( _ s) - 2/a e (-z) < e - 2/o B (-aO) 
< E[(T . (X) - 2/a £ ( a; )) 6 ] ( E[(T ae( _ x) - 2/a £ (-x)) 6 ] 



> x 



{e-2/a s {x)f ( e -2/a e {-x)f 

64(l + 0(l/a e (;r)) 64(l + 0(l/a B (ar)) 



27a e (x) 9 (xy/ey6(l + 0{^))f 27a £ (~x) 6 (~x^/6(l + 0{y/I))f 
2(l + 0(l/g e (x)) 2 / e 

a*(l + O(Ve)) " z 6 (l + O(e)) V x 6 (1 + x ^Jq ) 



since 



a e (x) ^+xy^2/(^) + 0{l) 2i + x ^Zj6 + 0{e)' 



(4.22) 



(4.23) 



Since the area x < is restricted to x < c/^/e in d4.2 ID . the 0(-)-term on the right 
hand side of H4.22I I does not have a pole. It is now easy to obtain an integrable function 
dominating ( 14.21H , leading to 03 . 3D . 

4.4 Proof of Theorem [1] 

By Proposition 12.111 5. (see BDGP12I Thm. 2] for details) the tree-valued Fleming- 
Viot process with selection has a law which is absolutely continuous with respect to 
the neutral process. Therefore it suffices to consider the neutral case, a = 0. We 
observe that for a = the claim is not affected by mutation. Moreover, it suffices to 
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deal with the case X a = . The reason is that ( 13.171 1 is equivalent to the assertion that 
lim e _j. = 2 for all t > 5 holds for all S > 0, and iVj is independent of X for e < t. 
Let 

T* :— inf{e > : (U t ,r t ) can be covered by n balls of radius 2e}, (4.24) 

i.e. T£ is the minimal time we have to go back from time t such that we have n ancestral 
lineages. It suffices to show (see around I I4.8D ) that P(nTf t "^°°> 2 for all t > 0) = 1. 
To prove this, we need to extend the proof of Proposition 13.11 It suffices to show that 
nTl -> 2 uniformly for < t < 1 (if X = X^). 
First, by Lemma [4~T1 for all i > we have 



E[(T^-E[T*]) 8 ] <— . (4.25) 
Therefore considering the process along a discrete grid, we have for any e > 0, 



fc=0, 



sup \nT k J^ - 2| > e ) < J2 P(l«^/" 2 - 2| > e) 



k=0 



(4.26) 



^ 2 E[(r°- E[ r°]) 8 ] < i 

~ (e/n) 8 ~ e 8 n 



Since the right hand side is summable, sup fc=1 n a \nTn^ n — 2| ,w °°> almost surely 
by the Borel-Cantelli lemma. Now we need this in continuous time and derive for the 
difference (nTn 71 — 2) bounds from above and from below. 

First observe that {T^ > e} C {T£ > e — (t — s)} for t — e < s < t for tree-valued 
Fleming-Viot process. (This property holds in every population model, arising as a 
diffusion limit from an individual based population where we can define ancestors, since 
the ancestors at time t — e of the population at time t must then be ancestors at time 
t — e = s — e + (t — s) of the population at time t — e < s < t.) We can now write, for e > 
and n > 2/e: 



sup nTl > 2 + e] < Pf sup 7;L*" 2 J/™ 2 > 



2 + e — (t— jVj 



o<t<i / v o<t<i n 

<P( sup T^ 2 > 2 -±^-\) (4.27) 

<p( su P r^>i±f/?)<^. 

Hence lim sup^^ sup 0<t<1 nT,' < 2 almost surely, by the Borel-Cantelli lemma. 

For the other direction of the inequality we use {T* < e} C {T* < e + s — i} for s >t, 
to get 

' sup nTl < 2 - e) < p( sup < £^£ + JZLL _ t 

0<t<l / ^0<t<l 

<p( sup T n fe /« 2 <±^ + 4) (4.28) 
<P( sup T^/" 2 < 

^ fc=0,...,n 2 

Hence liminf n ^oo sup 0<t<1 nT^ > 2 almost surely. 

Combining both, the estimate from above and from below we obtain the assertion of 
Theorem [1] 



2-£ 


[tn 2 ] 


h 

n 


9 


2-e 




h 

n 


^) 


2-E/2- 


)< 1 


71 
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4.5 Proof of Theorem [2] 

We proceed in the following four steps. 



• Step 0: Warm up; computation of the first two moments of B E (t) — B e (s). 

• Step 1: Computation of the first two conditional moments of B e (t) — B s (s). 

• Step 2: The family (B £ ) £>Q is tight in Cr([0, oo)). 

• Step 3: For (B(t)) t >o a limit point (B(t)) t >o as well as (B(t) 2 — t)t>o are martin- 
gales. 

Throughout we let (J*)t>o be the canonical filtration of X = (X t )t>o- 



Step 0: Computation of first two moments of B e (t) — B B (s). The first moment of 
B E (t) — B e {s) eguals since by assumption the tree-valued Fleming-Viot process is 
in equilibrium. 

For the second moment, we start by noting that (see Proposition [372) 

N s e = - + J^-Z s + o(^=) (4.29) 

for some random variable Z s ~ iV(0, 1). Note that N* and N$ are independent given 
(t — 5,t]r\(s — e, s] =0. (The reason is that in this case N E depends on resampling events 
in the time interval (s — e, s] while Ng only depends on resampling events in (t — S,t], 
and these two sets of events are independent.) Without loss of generality we set s = 
and compute the variance of B e (t) as 

Var[B s (t)] = 3 f [ Cov(iV;, N°)drds = 3 f [ Cov(N^N^)drds 

JO JO Jo Jov{s-e) 

ft feM re 

= 3 / / Cov{N°- S ,N°)d6ds 6t ■ / Cov(iV e , N*)d6. 
Jo Jo Jo 



(4.30) 



In order to compute the integrand of the last expression, we decompose N° = N®_ s — 
(N®_ s — N®). Now N®_ s — N® is independent of N E . The former is the number of lines 
the tree at time looses between times e — 5 and e in the past, and therefore only 
depends on resampling events between times — e and S — e. The latter only depends on 
resampling events between times 5 — e and 5. Hence, 

Cov(iV e , N s £ ) = Cov(iV e _ 5 , N s £ ). (4.31) 

Consider now the dual representation of our equilibrium by the Kingman coalescent. 
Let Kf be number of lines of a subtree, starting with N lines, in a tree starting with oo 
many lines, at the time the big tree has i lines. In OPWWlll Lemma 4.1] it is shown that 

E i K "i - A^b' < 4 - 32 > 
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Hence, for independent Z, Z' ~ N(0, 1), 
Cov(N°,N s £ ) = Cov(N°_ s ,N*) 

= Cov(N°_ s ,E[N*\Ni,N2_ s }) 



= Cov(N»_ s ,E[Ks .\*i>W-s\) 
= Cov(iV°_ 5 , 



N°_ s + N§-1 



Cov|v/2/(3(£-i))Z + o(l/ V?), 

(i^ + a/2/(3( £ -<y))Z + (y^))(| + y^flZ' + o( )) 
S + 7 + \/27(3(F^))Z + v 7 ^)^' + o(l/^") 



-Cov V2/(3(e-*))Z + °(l/V£), 



(l + v/(e-(5)/6Z + o(Ve)) (l + v^Z' + °(v^))) 
1 + | V(e - <5)/6)Z + y/Sj&Z' + o(^) 
Cav(y/2/(Z(e -8))Z + o(l/VS), 



((l + v/(e - <J)/6Z + + o(Ve))) 



l--y/{e-6)/6)Z 

£ 

e-S 



G/EZ' + o(y/e) 



-Cov y/2/(3(e -S))Z + o(l/y/e), y/{e-6)/6Z + -y/S/6Z' + o(y/e) 



3e 2 



leading to 



Var[B £ (t)] =3t f Cov(7V £ °, N*)d5 e «° 3i / ^d<5 = t. 



(4.33) 



(4.34) 



Step 1: Computation of first two conditional moments of B E (t) — B £ (s). We can compute 
the first conditional moment as 



E[B s (t) ~ B e (s)\ T s ] = 



E[Nl-E[N™]\F s }dr 



s+e 



E[N r e -E[N™]\F s ] dr 

(4.35) 

since we started in equilibrium and iVJ is independent of T a for r > s + e. So, by 
Proposition 13. 21 



E[(E[B 8 (t) - B e (s)\F s ]Y] < e 2 E[(N™ - E[JV»]) 2 ] ^ 0, 



which implies 



E[B £ (i)-S e (s)|J- s ] 



^ in L 



(4.36) 



(4.37) 



For the second conditional moment, we extend our calculation from Step 0. Here, 



E{(B e (t) ~ B e (s)) 2 \T s ] = 3 



E[(Np - E[N™]){Np - E[iY £ °°])| J" s ]dridr 2 



s J s 

t 



3 f f 1 Cov [iV £ ri , AT 2 ] dn rfr 2 + 3 A £ , 

■/s+e J s+e 



(4.38) 
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with 

n\A e \]=E 

= E 



t ,nA s+e ^ E[N oo ]){N p _ E [N™])\T s ]d ri dr 2 

s J s 
s+2e /-riA(s+e) 



E[|(JVp - E[JV e °°])(JVT - E[N™])\F s ]dndr, 



-s+2e /-riA(s+e) 

EfKiVj 1 - V[N™])(Np - E[iV £ °°])|]dridr2 
" / 1 E[\(Np - E[N™])(Np ~ E[N™])\]dndr 2 



< 2e 2 Var[N~ 



> 



by Proposition [372] So, combining the last two displays with J4.34D , 

E[(B e (t) -B E {s)f\F s ] ^t-s inL 1 . 



(4.39) 



(4.40) 



Step 2: The family of the laws of (B £ ) £>0 on Cjr([0, 00)) is tight for e — > 0. We use the 
Kolmogorov-Chentsov criterion; see e.g. llKal021 Corollary 14.9]. We bound the fourth 
moment of the increment in B e . We write a £ := EfTV,? ] such that, again making use 
of the independence of and TV^ 1 if |s 2 — si| > e, as well as of Proposition 13.21 we 
estimate for fixed t and e — > 



ds4 ds3 ds2 ds\ 



(4.41) 



ds4 ds3 ds2 dsi 



ft rS! ps 2 />S 3 

E[(B £ (i)) 4 ] < I 

/o Jo Jo Jo 

/ / 1 E[(iV £ Sl _ a e ) • (JV« - a s )]d S2 d Sl N ' 

'0 JOV(si-e) 

f* /-Sl /-S2 /-S3 4 

/ ElL-nW-^ 

/0 JOV(si-e) J0V(s 2 -e) J0V(s 3 -s) L 4-1 

< (t(t A £)Var[^ e °°]) 2 + t(t A e) 3 E[(A^ - a e ) 4 ] 
<t 2 

and the tightness follows. 



Step 3: If (B t )t>o is a limit point, then (B t )t>o as well as (B 2 — i)t>o are martingales. 
Let = (B t ) t >o be a weak limit point of {(l? e (i)) t >o : £ > 0}, which has continuous 
paths by Step 2. We know from Step that B t is square integrable which allows for the 
following calculations. For < r\ < ■ ■ ■ < r„ < s and some continuous bounded function 
/ : R" — > R, by (£757) 

|E[(S t - B a ) ■ f(B ri ,. . .,B rn )]\ = lim |E[(S e (t) - B £ (s)) ■ f(B e (n), B £ (r n ))}\ 



< lim E 



E[B e (t) - B £ (s)\T s ] ■ f(B e (n), . . . , B e {r n )) 



0. 



Since n, . . . ,r n and / were arbitrary B is a martingale. Similarly by d4.40D , 

|E[((B t - B s ) 2 - (t - a)) ■ f(B ri ,...,B r J}\ 

= lim |E[((B e (t)-B e (s)) 2 -(t-«))./(fl e (n),...,B 8 (r n ))]| 

< lim e[ V[(B £ (t) - B £ (s)) 2 - (t - s)\T s ] ■ f(B £ ( ri ),...,B e (r n )) = 

s—too L 



(4.42) 



(4.43) 



0. 



which shows that {B 2 — t)t>o is a martingale. Then by Levy's characterization of Brow- 
nian motion, B is a Brownian motion. 
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5 Preparation: Computing moments of *&\ 2 and Sf^ 2 

The key to the proofs of Proposition 13.31 and Theorems [3j [4j [5] and [6] is a proper 
understanding of the behavior of the functions ^ 12 (X t ) and ^ 12 (X t ) from Remark 1 2. 5 1 
along the paths of the tree-valued Fleming- Viot dynamics. In this section, we provide 
useful tools for the analysis of these functions. In particular, we are going to compute 
moments up to fourth order. 

Throughout this section we assume that 

X = (X t )t>o is the neutral tree-valued Fleming-Viot eguilibrium process, (5.1) 

i.e. with X = X^, and denote by (J- t )t>o the canonical filtration of X. Furthermore, 
for the mutation operator, we will assume throughout this section that 

/3(u, •) is non-atomic for all u e A. (5.2) 

This assumption implies that every mutation event leads to a new type. In particular, 
this will be crucial in Lemma [5T8l 

5.1 Results on moments of ^ 12 (X t ), ^ 12 (X t ) and its increments 

The key is to obtain the power at which ty 12 vanishes asA^co respectively at which 
order t i-> ((A + l)^]^(X t ) — 1) vanishes as t — > 0. Proofs of the next two lemmata are 
given in Section [5T3] 

Lemma 5.1 (Convergence for fixed times). 

For all t > 0, 

(A + l)^ t )-^l, 



(\ + 2V + l)*\ 2 (X t )^l 



(5.3) 



in 1/ (and therefore also in probability). 

Lemma 5.2 (Second moment of increments of t i-» ((A + l)ty\ 2 (X t )). 
There exists C > 0, which is independent of A, such that 

sup(A + l) 2 E[(*! 2 (AT t ) - *f(X )) 2 ] < Ct, (5.4) 

A>0 

sup(A + 2i? + l) 2 E[($i 2 (^) - $]?(X )) 2 } < Ct. (5.5) 

A>0 

Remark 5.3 (Tightness). 

If the right hand side of H5.4D would have been Ct 1+e for some e > 0, then Lemma [5T2l 
would imply the needed tightness for Theorem[3]by the Kolmogorov-Chentsov tightness 
criterion; see e.g. BKal02l Corollary 14.9]. However, since it is only linear in t, we will 
have to use fourth moments to get the desired tightness property. 

The main goal of this subsection is therefore to compute the fourth moments of the 
increments of t ^ (A + l)^ 12 (X t ) and of 1 1-> (A + 2d + l)$ 12 (X t ). The proof of the next 
two results are given in Sections 15. 51 and l5 .61 

Lemma 5.4 (A fourth moment of X^). 

Let be the Kingman measure tree. Then, as A — > oo, 

E[((A + 1)*^) - l) 4 ] = + (J_) , ( 5 .6) 
E[((A + 2tf + l)*i 2 (Ar oo )-l) 4 ] = A + ( 5. 7) 
In particular, the convergences in 1 15.31 ) hold in L 4 as well. 
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Lemma 5.5 (Fourth moment of increment of 1 1-> ((A + l)*] v 2 (X t )). 
There exists C > 0, such that 

sup(A + l) 4 E[(*i 2 (X t ) - f i 2 (X )) 4 ] < Ct\ 

A>0 

sup(A + 2d + l) 4 E[($i 2 (X t ) - $I 2 (X )) 4 ] < Ct 2 . 

A>0 

We start with second moments and afterwards, we provide an automated procedure 
how to compute higher order moments using Mathematica. 



(5.8) 



5.2 Moments up to second order 

Here, we show that (A + l)*i 2 1 and (A + 2i? + l)§i 2 ^=^> 1 in I? (Lemma|5~lTl 

and study the second moments of the increments of 1 1-> ((X+l)^ 12 (X t ) — 1) (Lemma l5.2l l. 
We start by defining some functions which appear in the first and second moment equa- 
tions for the polynomials ty 12 and ^ 2 defined in Example 12 .51 

Definition 5.6 (Some functions on U). 
We define the following functions in U 1 : 



12,12 
A 

12,23 
A 

12,34 
A 

$0 



12,12 
A 

12,23 
A 

12,34 
A 



(U,r,n) 



(U,r,fi) 



®N g — \(r(u 1 ,u 2 )+r(u 3 ,u 4l )) 



(m 1 
1, 

$i 2 ((!7^)) 



-A(l-(tti,M 2 )+r(ll2,U3))\ 



1/ , e -M«i.«2)\ 

- u -{ai=a 2 } t ' /i 



Tl. . Tl . . p-Kr(u.l,u.2)+r(u 2 ,u 3 ))\ 

-"-{01=02} -"-{02=03} / 



{ai=a 2 } l{a 3 =a 4 }£ 



and centering around the equilibrium expectations 

1 



- A(r(ui ,^2)+^(^3 ,^4)) 



-v-12 
1 A 

12,12 
A 

12,23 



*A 2 ' 

-,-12 
1 2Aj 



A + l 
1 



^12,23 _ ^12 _ 

A 2 2A A + 2 



A + 6 



T 12.34 
1 A 


:= *f' 34 


^12 
1 A 


:= *i 2 _ 


CU2.12 
1 A 


X 12 
— i 2Ai 


12.23 

L x 


:= %\ 2 ' 23 



4 12,23 4 ^12 

3 15 
1 



2A 



2(A + 2)(2A + 3)' 
3(A + 5) A ~ 15(A + 3)(A + 5)' 



15 



+ 2tf- 
1 

2 + i? 



A + 2i? + 2 



+ 



A + 3i9 + 6 



(?? + 2)(A + $ + 2)(2A + M + 3) ' 



T 



12.34 



:= * 



12,34 



12,23 



2(1-*) 



(5.9) 



(5.10) 



r (tf + 3)(2tf + 5) 2A (3 + tf)(A + 2tf + 5) 
2A + j? - 2i9 2 + 15 
~ (1? + 3)(2i? + 5)(A + 2i9 + 3)(A + 2tf + 5) ' 
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Remark 5.7 (Connections and a first moment). 

Our main object of study are the functions ty 12 and <I' A 2 . Note that since X a has the same 
law as and since in the time it takes that two randomly chosen lines coalesce is 
an exponential random variable R with parameter 1, we have 

m\ 2 (Uo)} = E[{fi m ,e- Xr ^^)} = E[e- Ai? ] = — J— . (5.11) 

A + 1 

Moreover, if mutations arise at constant rate d/2 > 0, consider again two randomly 
chosen lines which coalesce at time R, and the first mutation event at an independent, 
exponentially distributed time S with rate using ( 15.21 : 

E[§ A 2 (W )] = E[( M ^ N , l {ai=a2}e -^— *>)] = E[e- XR MR<s}} 

= E\e- XR ■ P(S > R\R)} = E[e- (x+m ] = - \ . (5 ' 12) 

l L J A+tf+1 

Moreover, we write 

(5.13) 

and analogously for ^^ 2 ' 34 . 

Next we compute the action of the generator of X on the functions from Defini- 
tion [5T6l 



Lemma 5.8 (ft* A and OT A ). 

1. Let fl be the generator of the tree-valued Fleming-Viot dynamics. Then 



rw\ 2 = 


-(A + 


1)1- A 2 + 1 




n*\ 2 - 23 = 


-(2A- 


f 3)* A 2 ' 23 - 


1- 2* A 2 + * 


m>\ 2 ^ = 


-(2A- 


f 6)* A 2 ' 34 - 


!_ 4 ^12,23 + 


nr\ 2 = 


-(A + 


1)TI 2 , 




nr 12 ' 23 = 


-(2A- 








-(2A- 


f 6)T A 2 - 34 , 





12 
2A- 

2* 12 



A i 



(5.14) 



(5.15) 



and 

ft$ A 2 = -(A + 2-0 + 1)$ A 2 + 1 • # A , 
fi$f' 23 = -(2A + 3tf + 3)§ A 2 ' 23 + 4§ A 2 , 

12,34 = _(2A + 4^ + 6)$ A 2 ' 34 + 4§ A 2 ' 23 + 2§ A 2 , 
nf\ 2 = -(A + i5» + 1)T A 2 , 
QT A 2 ' 23 = -(2A + 3tf + 3)T A 2 ' 23 , 
12,34 = _^ 2A + M + 6 )fi 2 - 34 . 

2. The following processes of evaluations of functionals of X = (X t )t>o are martingales: 

(e (X + l) tT 12 (Xt)) ^ o) (efWT^))^ (e( 2 ^) tT 12,34 (Xt))t ^ ; (5 16) 

and 

(e (A+M+1)t Ti a (X t )) t > 0) (e^+^+^Tf 23 (X t )) t > , (e^+^'Tf 34 ^))^. (5.17) 
Furthermore for all s > and a]J G {12; 12, 23; 12, 34} 

E[T k x (X s )} = E[f k x (X s )] = 0. (5.18) 
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Remark 5.9 (Equilibrium expectations of vf^). 

Since E[T^(Xoo)] = Epf^X*,)] = by ( 15.181 ), it is also possible to compute the equilib- 
rium values for vlj* and ^ We write for later reference, 

^12,23 _ ^,12,23 5A + 3 



A — * A 



(A + l)(2A+l)(2A + 3) 



(iff? 



2A + 17 A + 2V^ A A + l 

T 12,34 = ^12,34 4A 2 + 18A + 9 

A A (A + l)(A + 3)(2A+l)(2A + 3) 1 

_ 4 f^i2,23 5A + 3 

3V A (A + l)(2A + l)(2A + 3) 

1 ^-^-U^-^f* 



15 v 2A + 1 / 3(A + 5) V A + l 



and 



^12,23 _ $12,23 5A + 6l9 + 3 



(A + 2d + 1)(2A + 2d + 1)(2A + 3i? + 3) 



1 ^*i2 A ^ Ar;i2 



t9 + 2V^ 2A 2A + 2i? + l/ A + ?? + 2V A A + 2i? + 1 

T 1Z,34 = 



U2.34 -f-,12,34 4A2 + 2A(5tf + 9) + 2W + 6d 2 + 9 



(A + 2d + 1)(A + 2tf + 3)(2A + 2d + 1)(2A + 3tf + 3) 
mo 23 5A + 6i? + 3 



— — «\ 

3 + tfV A (A + 2tf + l)(2A + 2tf+l)(2A + 3tf + 3) 
4 



(5.20) 



(t? + 3)(2^ + 5) V ^ 2A 2A + 2i? + l 

2(1-*) r$i2_ 1 



(i9 + 3)(A + 2^ + 5) V A + 20 + L 

where the terms after each vf^ and $| are the corresponding equilibrium values. 
5 . 3 Proofs of Lemmata [57T1 15T21 and HT8l 

Proof of Lemma \5.8\ The results in ( 15.14b and d5 . 1 5D follow by simple calculations. The 
stated martingale properties are consequences of general theory; see Lemma 4.3.2 
in HEK86I . In particular, since the equilibrium distribution of X is ergodic, for k e 

{12; 12, 23; 12, 34} and A 12 = A + 1, Ai 2 , 23 = 2A + 3, A 12 , 34 = 2A + 6, 

E[T*(X S )] = Urn E[?t(X t )] = lim e- x ^-^E[T k x (X s )} = ■ E[T^(X S )], (5.21) 

t— ^oo t— yoo 

which can only hold (note that |E[T^(X S )]| < oo by definition) if E[T h x (X s )] = 0. Re- 
placing A fc by A fc , A12 = A + 2$ + 1, Ai 2 ,23 = 2A + 3t? + 3, A i2 ,34 = 2A + 4t? + 6 gives the 
corresponding results for the expectations of Y*. □ 

Proof of Lemma\5A\ We only give the full proof for the first assertion in d 5 . 3 D since 
the second follows exactly along the same lines. The proof amounts to computing the 
variance of *S>\ 2 . Therefore, we need to understand the expectation of ( v &^ 2 ) 2 = <1>^ 2 ' 34 
(for the last equality see I I2.6D ). To this end we express the ^\'s in term of the Ta's and 



29 



obtain 



= Ti a 



12,23 ^12,23 



T 



A + r 

1 -y-12 

+ 2 1a 



^12 
" 1 A 



5A + 3 



2 + A (A + l)(2A + l)(2A + 3) : 



12,34 -v-12,34 



= T\ 



3 T * 



12,23 



5 T * 



12,12 



2(A + 6) 



T12 
A 



(5.22) 



(A + 2)(A + 5) 

4A 2 + 18A + 9 

(A + l)(A + 3)(2A + l)(2A + 3)' 



Together with H5.18D , this implies 



E[((A + l)* 12 (X t ) - l) 2 ] = (A + l) 2 • E[*f ' 34 (X t )} - 2(A + 1) • E[*i 2 (X t )] + 1 



2A 2 



A— >oo 



(A + 3)(2A + l)(2A + 3) 



> 0. 



(5.23) 



□ 



Proof of Lemma [5.2\ Again, we restrict our proof to ( 15.41 1 since ( 15.51 1 is proved analo- 
gously. We compute 



1 

aTi 

2A 2 



(5.24) 



(A + l) 2 (A + 3)(2A + l)(2A + 3) 

and (recall that we start in equilibrium) 

E[(^ 2 (X t )-^ 2 (X )) 2 } 

= E[(* A 2 (AT t )) 2 - (* A 2 (X )) 2 - 2^ 2 (X )^\ 2 (X t ) - ^ 2 (X ))} 



E 



2*| 2 (x ) / nri^x^ds 



f E{2*{ 2 (X )(\ + l)E[Ti 2 (X s )\T Q }}ds 
Jo 

2 / (X + l)e-^ +1 >d S --E^ 12 (X Q )T 12 (Xo)] 



(5.25) 



< 



(A + l) 2 
and the result follows. 



t 



□ 



5.4 Moments up to fourth order 

For the proofs of Lemma 15.41 and Lemma 15.51 we use moment calculations of the 
increments of (^ 12 (X t ))t>o and (\I> A 2 (X t )) t >o up to fourth order. The calculations are 
presented in an algorithmic way such that higher moments can be computed along the 
same lines. The fundamental idea is that all computations that need to be done are 
linear maps on the vector spaces: 

V A := span({1>5 : (U, r, /i) i-> (^ m , U e - Ar («i.«jf)) : I finite subset of N 2 }), 

V A := span({$f : {U,r,fi) ^ (fi m , J] ^{ ai = aj }e~ Xr{Ui ' Uj) ) : I finite subset of N 2 }) . 

(5.26) 
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We point out that one advantage of using matrix algebra is that it is possible to use 
computer algebra software such as Mathematica for automated computations. 

Apparently, a basis of the vector spaces V A and V A are given by (recall from Defini- 
tion [5T6} 

2 A :=(* A ,^ 2 ,*f 12 ,*f 23 ) *f 34 ,...), (5.27) 
$ A := ($ A , § A 2 , $ A 2 ' 12 , § A 2 < 23 , $ A 2 ' 34 , . . . ), (5.28) 

respectively. By inspection of e.g. Lemma [5T8l it is clear, that the generator gives rise 
to linear operators V A — > V A and V A — > Y\. The bases J> A and vj/ A can be ordered such 
that the matrices representing the linear maps £1 on V A and V A are upper triangular; 
see also Lemma [5T8l 

In the next section, we give the action of f2 on the first 36 basis vectors for each 
basis. The last basis vectors in these collections are given by 

* A 2 ' 34 ' 56 ' 78 : {U,r,ft) h-> (^ N , e - Xr ^™-™), (5.29) 

where we write ri 2 , 3 4,56,78 := r(u 1 ,u 2 ) + r(u 3 ,« 4 ) +r(u 5 ,u 6 ) +r(u 7 ,u$), and 

$ A 2 ' 34 ' 56 ' 78 : W^) -> l {ai = a2 , a3 =a 4 ,a 5 =a 8 ,a 7 =a 8 }e- Ari2 ' 34 ' 56 ' 78 ), (5-30) 

respectively. Because of (* A 2 ) 4 = q,f^ 56 >™ an d ($ A 2 ) 4 = §i 2 - 34 > 56 - 78 both 36's basis 
elements play important roles. 

The matrix representation of the generator fl 

Here, we give the matrix representation of the generator fl in terms of the basis * A , i.e. 
we find matrices A and A such that 

ml = ml = a$ a . (5.3D 

The following list contains the action on the first 36 basis vectors from }&_ x and }&_ x . 
In 5. as an example (see below) we are dealing with the function 

* A 2 ' 34 ( and $ A 2 ' 34 ), for which we use as abbreviation. (5.32) 

As this symbol indicates, in , 3> A 2 ' 34 , two non-overlapping pairs are sampled. 
We have already seen in d5.14b that 

^12,34 = _ ( - 2A + 6 )^12,34 + 2 ^12 + 4^12,23^ (g 33) 

^$12,34 = _ ( - 2A + 4l? + 6 )$12,34 + 2 $12 + 4 §12,23 (5 34) 

Our list only contains the last two terms (which are egual in both equations) which arise 
from r~\ r~\ by merging of a pair of leaves, i.e. merging for example the marked 
leaves in to t^~Y~*\ or merging the marked leaves in to . The 

coefficient of * A 2 ' 34 in fM A 2 ' 34 (and of $ A 2 ' 34 in ft$ A 2 ' 34 ) is given such that the sum of 

12 34 12 34 

coefficients in f7* A ' equals A times the number of pairs (i.e. 2) in \P A ' (and $ times 
the number of different indices (i.e. 4)). Since the table would be the same for the $ A 's, 
we restrict ourselves to the & x 's. 

0. * A = • -> — 

1. * A 2 = -> lx • 

2. * 12 ' 12 = A -> lx • 
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3. $rf' 23 = r^T^ -^2xf^\ lx^S 



5. *i 2 > 12 ' 12 = n _> ix 



6. 



^12,12,23^ ^i x r^ 1X^ lxA 



7. *f 13 ' 23 = ^3x^ 



8. ^ 2 - 12 < 34 = /^ ->i x r^ lxFS 4x/^V^ 



g ^12,23,24 _ ^-yS^ -^3xr^\ 3X/^V^ 



io. ^\ 2 - 23 - 34 = r^ry^\ -^3xr^r^ 2xf = \~\ ix^y^ 



i2. ^12.34.56 = ^ n\ ^3 X r^ i2 X r~v^ 



13. *«W2,ia sr ^ _> lx . 



i4. *f 12 ' 12 > 23 =fV> ^ixr^ lxfi ix 



15. ^2,12,23,23=^^ ^2xA 1> 

16. #12,12,13,23 = ->lx/^ 2> 

17. ^12,12,23,34 _ ^=y~Y^ ^lxr>^ 2x/ ;J V^ lx lx / ;= ¥ : ^ b 

18. ^12,23,23,34 _ ^y^-^ lx r>^\ 2x 2x A^ lx/^V^ 

19. = ^3x^V^ lx^^ 2x/^V^ 

20. *f 23 ' 34 ' 14 = ^vS^ ^4x^^ 2x/^¥^ 

21. ^12,12,23,24 ^^^ > lx^Y^ 2xA^ 2x lx 

22. VL712,12,34,34 ^ ^ ^^ 2 xA 4x fYS 

23. vL7l2,12,12,34^^ ^ ^ Ix lxA 4x 

24. ^12,23,34,45 ^^^^^^ ^ 4x r>^Y^ 2x /^^Y^Y^ lx rV = V^ 



2x^^S ?= >^ lx/^rS^ 



25. ^12,23,34,25 = ^-y^S^ -> 2x r^S^\ 2x r^r^r\ lxA^ 

2xrYV^ 2x^ < Y^\^ lx/^^VT^ 
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26. # 



12,23,24,25 
A 



28. # 



12,12,23.45 



2j ^12,12,34,45 _ ^ 



-> ixr>^\ 2x 



Axf^TY^ 2xf = YS^\ 



lxA A 

^V^ r\ -^ixr^ ix/^n ixAn 2xf\^y^ 



29. - 23 ' 13 ' 45 = ^Sr^ -^ix/^y^\ 3x/^n ex/^T^^ 



30. * 



31. * 



12,23,45,56 



12,12,34,56 



A 




4xr^r^\ 4x, 

Ar^r^ -> i x r^ 2x A 4x A 
8x/^V^ 



32. * 



12,23,24,56 
A 




lx/^A ^ 3X| 

3x/^V^ 



33. * 



12,23,34,56 
A 



r~\ -> ix^~y~y~\ 3xry^\ 4x, 



34. ^12,23,45,67 = ^ ^ ^ 2x ^ 2x ^ ^ ^ 




ixP\nn 4x, 



35. ^12,34,56,78 ^^ ^ ^ ^ ^ 4x ^ ^ 24x , 



For the matrices A (and A) representing il in terms of the basis * A (and J> A ), we give 

here only the first 13 (5) rows and columns for A (A), which are dealing with samples 
of at most three (two) pairs (i.e. \1\ < 3 (< 2) in H5.26I Q. For A, we get 










V 







-3 
-3 



3A + 3 








3A + 3 


-1 










3A + 6 



-1 











3A + 6 


-2 













3A + 10 
-12 










3A + 15 / 



while for A we have 
/ 



-A 



-1 A 
-1 





V o 





2d- 

2 






2A + 20 
-1 




2A 







30- 








2A + 40 + 6 J 
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Note that both matrices are diagonalisable. The reason is that the submatrix contain- 
ing the same eigenvalue (take 3A + 6 in A above as an example) is diagonal. Hence, 
there are three independent eigenvectors for this eigenvalue and so, we find a basis of 
eigenvectors. 

In the rest of this section, the calculations for the ^ x 's and ^'s are completely 
analogous using the T A 's instead of the Y A 's. Hence, we only present the calculations 
concerning ^'s. 



for s < t. We do this using the following procedure. We 



Conditioning as a linear map 

During our calculations, we need to be able to compute terms like 
E 

have given the following objects: 

1. A basis 

T A :=(T A ,T^T A 2 ' 12 ,...) (5.35) 

such that fiY A = AfcT A , i.e. Y A is an eigenvector of the generator Q for the eigen- 
value A fe (where k e {0; 12; 12, 12; 12, 23; 12, 34; . . . }. As argued below, these eigen- 
vectors exist, since A is diagonalisable. 

2. A diagonal matrix D t - S with diagonal entries e - A fc(*- s ) in the fcth line. Since the 
Afe's are given by 1., this matrix is readily obtained. 



3. An invertible matrix M^ x = (M^)" 1 , such that £ A A/-j- A = T A and T X AI^ X = * 



These two matrices allow a change of basis from J> A to T A and back. 

Since T A is the basis of eigenvectors of the matrix A, the matrices AI^ X and AI^ X 
are obtained by standard linear algebra. We stress that both matrices are lower 
triangular since A is lower triangular. 



(5.36) 



Then, because (e Afc *T A (t)) t > are martingales, (compare with Lemma l5~8l l, 

E[£j(X t )|J- s ] = Mf* ■ E[Tx(Xt)\T s ] T = A/f A ■ D t _ s ■ T X (X S ) T 
= M§ x ■ D t _ s • Aff * • £ A (X S ) T . 

This means that conditioning linear combinations of elements in \£ A (X t ) on F s can be 
represented by a composition of linear maps. 



Multiplication with * A 2 as a linear map 

We need to compute the action of multiplying an element of Va with ~$>\ 2 . Since Y\ is 
actually an algebra of functions (see Remark [23] and Remark 2.8 in BGPW12II ), this can 
be done. We have that 

*A 2 *A = *f 34 , * A 2 ' 12 ' 34 , * 12 - 23 > 45 , . . . ). (5.37) 

This means that there is a linear matrix Q such that 

*a 2 *I = Q*L ( 5 - 38 ) 
Apparently, Q is the matrix for a linear map on V A , with respect to the basis }£_ x . 
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Again, we only give the calculations for the ^'s in detail. We use 

((A + l)* 42 - l) 4 = (A + 1)4^34,56,78 _ 4(A + ^3^12,34,56 

+ 6(A + l) 2 * 42 ' 34 + 4(A + - 



(5.39) 



By construction, we obtain for all k £ 2 

E[*S(Xoo)] = E[(T A (X 00 )Mg) fe ] = (Mf *) 0;fc (5.40) 

since E[T*(Xoo)] = for k ^= as in H5.18D . Hence, (for the first two terms, see 
also (5?l2l ) 

(A + l)^^) = 1, 

(X + 1)W 2 - 34 (X ) 4A 4 + 26A 3 +49A 2 + 36A + 9 1 5 /J_N 

(A + l) * A {Xoo ) - 4A4 - 24A3 - 4?A2 - - - - 1 + - - — 2 + O j , 

n , 1 ^12,34,56^ v 36A 7 + 618A 6 + 4143A 5 + (3(A 4 ) 
^ 1 °° j 36A 7 + 564A 6 + 3487A 5 + 0(A 4 ) 
3 95 / 1 \ 
= 1+ 2A-18A7 + °fe)' 
,^-.^,12.34,56,78^ , 36864A 16 + 1536000A 15 + 28807680A 14 + 0(A 13 ) 



36864A 16 + 1425408A 15 + 24729088A 14 + C(A 13 ) 



i 3 193 <n/ 1 

Plugging the last expressions in (15-396 gives the result. 



(5.41) 



5.6 Proof of Lemma 15. 51 

Now we can compute the fourth moments of the increments of t n- (A + l)^ 12 (X t ). 
Again, we only give the proof of the first assertion. The second follows by replacing the 
^> A 's by the ^'s. Using Minkowski's inequality and the fact that we start in equilibrium 
we have 



(5.42) 



(E[(*f(X t ) - *i2(X )) 4 ]) 1/4 < (E[(* 42 (X t )) 4 ]) V4 + (E[(* 42 (X )) 4 ]) 1/4 

= 2(E[*i a ' 3 *' B6 ' 78 (X )]) 1/4 

and therefore, since E[* A 2 ' 34 ' 56J8 (A:o)] = 0(A~ 4 ) as A ^ oo by l5"¥TV we have 

(A + lfEK^iXt) - * A 2 (X )) 4 ] < 2 4 (A + 1) 4 E[* 42 ' 34 ' 56 ' 78 (X )] - 0(1). (5.43) 

Thus, the expression on the left hand side of 1 15.81 ) is bounded. 
Now, using 

(a - b) 4 = a i -b i - 46(a 3 - b 3 ) + 6b 2 (a 2 - b 2 ) - 46 3 (a - b) (5.44) 
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we obtain, 



E[(^(X t )-^(Xo)) 4 ] 

= E [*f -34,56,78^) _ ^12,34,56,78^ 

12,34,56/ v \ , T ,12,34,56/ 



- 4E[*?(X )(V? ' 34 ' 56 (X t ) - ' 34 ' 56 (X ))] 

+ 6E[*f' 34 (X )(*f' 34 (X t )-vI/f' 34 (X ))] 

_ 4E[*f < 34 < 56 (X )(*A 2 (* t ) - *i 2 (X ))] (5 - 45) 

-4 / E[*f(X )Q*f ,34>56 (X s )]ds + 6 / E[*jf' 34 (X )O*f' 34 (X s )]ds 
Jo Jo 



4 / E[*f 34 ' 56 (X )^i 2 (X s )]ds, 
Jo 



because *^' 34 ' 5fa ' 78 (X t ) = ^ x 2 ' 34 ' 5bJH (X a ), since we start in equilibrium. 

Let us consider the expectation in the second term on the right hand side. We have 

^l 2 ' M (x )n^ 3i (x s )] = E[^ 3 \x )-E[m\ 2 ' M (x s )\To]] 

= E[ttf 34 (X )(A ■ mx(X s )\T }) 12M ] 
= E[*i 2 ' 34 (X )(A ■ M^E[T A (X s )|7- ]) 
= E[*i 2 ' 34 (X )(A • M^D S M^* X (X )) 1 
= E[V?(X )(A ■ M^D S M^Q^ X (X )) 

= e[(a • M^z? s A/^gg* A (x )) 12 ]. 



12,34J 
2,34] 
12,34] 

12,34J" 



(5.46) 



The expression in the last line can easily be integrated in s over [0, t], because only 
D s depends on s. In addition, X is in equilibrium and hence, the expectation can be 
evaluated using the equilibrium distribution of X. All three terms in the right hand side 
of I I5.45D can be computed analogously. Using Mathematica, we see that d5 . 8D holds. 

6 Proof of Lemma 13.41 Propositions 13.31 and 13.61 and Theorems [3] 
and [4] 



6.1 Proof of Proposition [373] 

By Lemma [3~4l we have to show that I I3.7D holds. Recall <3> A 2 from H4.14D . We abbrevi- 
ate ^\ 2 := ty\ 2 (X 00 ) and compute, using Lemma l5~4l 



A 2 E[((A + l)vI/i 2 -l) 4 ] ^>^. (6.1) 
Therefore, there is C > 0, such that 

mx+w-i\>.)* m+iy F- in s£*. (6.2) 

c c A 

It follows that almost surely |(n + l) 1 ^} 2 — 1\ > e for at most finitely many n by the 
Borel-Cantelli lemma. Thus, 

1 = lim [\]^} 2 x i < Uminf(A + 1)^\ 2 < limsup(A + l)^ 2 < lim [A -Hi*, 2 =1 (6.3) 

A— ¥00 ' ' A— foo A— >-oo A— >oo ^ ^ 

holds almost surely and the result is shown. 
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Recall that 1 13.71 ) is the short form of 

(A + l) / fi® 2 (d( Ul ,U2))e- Xr ^>^ 1. (6.4) 



Furthermore, since Fi(2e) is the probability of sampling a leaf from Bi(2e) under the 
sampling measure (and hence, Fi(2e) 2 is the probability of picking two leaves from 
Bi(2e)), for every realisation of N e ,Fi and /i^ we have 

iX>(2e) 2 = Yei ^ 2 (rf(«i,«2))l { , (ui , U2 )< 2E} = ^[0,2e]. (6.5) 

i— 1 

Here we set v := r(-, ■)*f-L < ^, i.e. the distribution of the distance of two randomly chosen 
leaves. Thus, the assertion of Lemma T3.4I is the eguivalence of A J e~ Xx v(dx) A ^°°> 1 
and \v[0, e] £ ^°> 1. This eguivalence is implied by classical Tauberian Theorems, as e.g. 
given in Theorem 3, Chapter XIII. 5 of !IFel66ll . 



6.3 Proof of Proposition [3761 

We proceed in three steps. 

• Step 1: Tightness of one-dimensional distributions of Z x . 

• Step 2: Moments and covariance structure. 

• Step 3: Tightness of Z x in path-space (in the space of continuous functions). 
Step 1: Tightness of one-dimensional distributions of Z x . We obtain from Remark [5.9[ 



E[(Z x f] = XE[(\t + l)H% 3 4 - 2(\t + + 1] 1 



which shows the claimed tightness. 

Step 2: Moments and covariance structure. For the covariance structure, we have to 

12^12 
sA t\ 



compute moments of ^Yi^Yl which are not included in the manuscript up to here. We 



set 

^{ 2 f? (([/,r, fl)) := ( Ai ®N ) g-Ar(t. I ,« a )-A / r( Ua ,«,)^ 



A + l 



T 12,23 ._ ^12,23 _ 1^12 _ 1 ,t,12 _ 1 ,T,12 

1 x,v — w a,A' 2 +A ' A + ? A' + 2 

AA' + 4A + 4A' + 12 
+ 2(A + 2)(A' + 2)(A + A' + 3) : 

T 12,34 ._ ^12,34 _ 4-12,23 , .£^12 , 1 ,j,12 , 1 ,t,12 

T A ,V A, A' 3 *A,A' + 15*A+A' + 3( A + 5 )*A +3 (A / + 5 )*A' 

4AA' + 25A + 25A' + 150 



(6.7) 



15(A + 5)(A' + 5)(A + A' + 
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Then 

Wff = -(A + A' + 3)^3 + $ 12 + ^12 + ^12 +vj 
Q *A 2 V 4 = -( A + A ' + 6 )*A 2 A^ 4 + 4 *A 2 A 23 + *A 2 + *A'i 

mr A 2 = -(A + l)T A 2 , (6.8) 
^ t a 2 a 23 = -(A + A' + 3)T A 2 A f, 

0T I 2 a 34 = -( A + A ' + 6 ) t a 2 v 4 - 
Now, in order to compute the covariance structure and higher moments, we write, again 
using Mathematica, 

E[ZX A ] = AE[(sA + l)(iA + 1)^,34 _ x] 

4stA 3 

~ ([s + t)X + l)((s + t)X + 3)((s + t)X + 6) (6-9) 
> 



{s + t) 3 ' 
For the third moment, 

E[(7 A V 3 1 = A 3 / 2 16t 3 A 3 (5* 2 A 2 + 9^-10) ^oo 

KtjJ (tA + 2)(tA + 3)(tA + 5)(2tA + l)(2tA + 3)(3tA + l)(3tA + 10) 

(6.10) 

The fourth moment was already given in 06.11 1. 

Step 3: Tightness of Z x in path-space. Here, we show that there exists C > 0, which is 
independent of A, such that 

sup A ■ E[((tA + l)*^^) - (sA + I^IKX^)) 2 } < C(t - s) 2 (6.11) 

A>0 

for < s < t. Tightness then follows from Step 1 and the Kolmogorov-Chentsov crite- 
rion. In order to show J6.1H , we simplify the notation and suppress the dependency on 
Xoo. From 06.7D , we read off E^ 2 ' 3 *]. Then, the result follows from 

((iA + l^f-^A + l)^ 2 ) 2 

= (tX + lf*%f x - 2(sX + l)(tX + l)^ 2 ; 3 ! + (sX + 1) 2 *H% 

with an application of Mathematica and d6. 1 1 D is shown. 

6.4 Proof of Theorem [3] 

As in the proof of Theorem [1] we only need to show the assertion in the case a = 
due to absolute continuity recalled in Proposition 12. Ill In this case, the proof of the 
theorem reguires the following three steps: 

• Step 1: Instead of starting in X , it suffices to start in eguilibrium and then 
show (13. 23ft . 

• Step 2: Assume that X Q is in equilibrium, i.e. X = X^. Then, the set of processes 

{((A + l)* A 2 (X t )-l) t >o : A>0} (6.13) 

is tight in Cr([0, 00)). 

• Step 3: The finite-dimensional distributions of {((A + l)4' A 2 (X t ) — l)t>o : A > 0} 
converge to as A — > 00. 
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These steps imply that the object in ( 13.23b converges to in distribution, which is (in 
the case of convergence to a constant) eguivalent to convergence in probability. 



Step 1: Start in equilibrium. Let X = (X t )t>o with X t = (U t ,r t ,jit) be the tree-valued 
Fleming-Viot process started in eguilibrium. Then, X and X can be coupled such that 
for e > 

(Mf N ,0) = (Mf N » (6-14) 

for all 4> : rW — > R which depends only on elements in r with values at most t. 

Recall the Moran model approximation. Observe that distances evolve with time 
of rate 2t and may change by resampling events by being reset to zero. The coupling 
arises in this model for every TV by taking the same resampling events for X and X 
which defines the two processes on a common probability space. For this coupling, 

sup (A + m\ 2 (X t ) - ^(X t )\ = sup (A + l)|(/if N - Mf N , l { ,(« 1 ,« 2 )>*} e " Ar( " 1 ' tl2) )l 

e<t<r £<t<T 

< (jjf N , Ae- Ae ) + (/If N , Ae~ AE ) = 2Ae" Ae 0. 

(6.15) 

Taking now on this common probability space the limit N — s- oo results in the coupled 
laws. 

Hence, it suffices to prove the assertion when started in eguilibrium, i.e. 1. holds 
for all assertions which concern properties which depend only on distances below a 
threshold and in particular all limiting properties close to the leaves. 

Step 2: Tightness of {((A + l)^\ 2 (X t ) - l) t > : A > 0}. This is clearly implied by 
Lemma [B31 and the Kolmogorov-Chentsov criterion for tightness in Cr([0, oo)). 

Step 3: Convergence of finite-dimensional distributions to 0. This follows from 
Lemma [5. 11 



6.5 Proof of Theorem [4] 

We proceed in several steps. 

• Step 0: Warm up; computation of first two moments of W\(t) — W\(s). 

• Step 1: The family (W\)x>o is tight in Cr([0, oo)). 

• Step 2: If (W(t)) t >o is realization of a limit point, then (W(t)) t >o and (W(t) 2 — t) t >o 
are both martingales. 



Throughout we let (J"t)t>o be the canonical filtration of the process (X t ) 



t>o- 



Step 0: Computation of first two moments of W\(t) — W\{s). We start with some basic 
computations which we will need in the seguel. First, recall that (A + 1)T^ 2 = (A + 
1)\&^ 2 - 1 and by Lemma |5~8l 

E[(A + l)Tf(X t )\F s ] = e- A (>- s > • (A + l)Tf(X s ). (6.16) 

Then, by Fubini's theorem 

E[Wx(t)-W x (s)\F s } = \J E[(A + l)Tf(X r )|J- s ]dr 

= A f e ~ A (— ) -(A + l)Ti 2 (X s )dr (6 - 17) 

J S 

= (l- e -X*-«)).(A + l)Ti a (X,). 
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We compute 

((A + 1)T A 2 f = (A + 1) 2 * 12 < 34 - 2(A + 1)* A 2 + 1 

2 ~± 
5 1a 



(A + 1)^(T A ^ + -T A 



2(A + 6) , 
(A + 2)(A + 5) 



4A 2 + 18A + 9 



(A + l)(A + 3)(2A + l)(2A + 3) 



-2(A + l)Tf -1 



/\ i ^2/^12,34 . 4 12.23 . 2 12.12 
i A + l ) \ X + 3 l \ + 5 L X 



8(A + 1) , 
(A + 2)(A + 5) 
2A 2 



(A + 3)(2A + l)(2A + 3)' 



which already implies that 



E[Wx(t)-W x ( 3 )\F a ] 



A— J- 00 



> in i 2 , 



since we started in equilibrium. Hence E[T A (AT )] = as in Lemma ( I5.18D . 
Next, we come to the second moment 

E[(VK A (i)-VK A ( S )) 2 |J- s ] = A 2 -E[(^ t (A + l) 2 Ti 2 (r) dr)*^ 

= 2\ 2 - [ f E[(A + l)T A 2 (X ri )E[(A + l)T A 2 (X r2 )|J-,, 1 ]|j- s ] ( ir2dri 
J s J n 

= 2A 2 - f /' t E[(A + l) 2 T A 2 (X ri )e-( A + 1 )^-'- 1 )Ti 2 (X ri )|j- s ]dr2dri 

J s J 7*1 



A 



s Jri 
2 /-t 



A + 1 7 S 

i - s + A x 



(1 _ g-CA+iJCt-nJj . E[((A + i)T A 2 (X ri )) 2 |J- s ] dn 



with 



A A = A x ,i + A x ,2 + A x ,3, 

4 - 4A 4 



A X ,2 



(A + l)(A + 3)(2A + l)(2A + 3) 



2A 2 (A + 1) 



2A + 6 



_ e -(2A + 6)( i - S ) )T 12,34 (Xs) 



4(2A+| (1 „ e -(2, +3 )( t - S ) )T 12,23 (Xs) 

2(2A + 6) 



5(2A + 2) 



(1 _ e -(2A+2)(t- S ) )T 12,12 (Xs) 



8(2A + 6) 



Ax,21 + Ax, 22 + Ax,23 



Ax,3 

where 

A X ,21 : 

^A,22 : 



— (1 - e" (A+1)( *" s), )T^ 2 fX ^ 
A 2 (A + 2)(A + 5) 1 6 ) l \^>>) 

2A 2 A + 1 / e-^+D^O .E[((A + l)T A 2 (X ri )) 2 |J- s ]dn, 

J S 

A 2 (A + l)^i2, 3 4 (Xs) _ 4A 2 + 18A + 9 



A + 3 V^ A y ^ s/ (A + l)(A + 3)(2A+l)(2A + 3)y 
2(A + l)(A 3 + 6A 2 + 8A + 24), 



(A + 3)(2 + A)(5 + A) 



(6.18) 
(6.19) 



(6.20) 



(6.21) 
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a _ ( 2 \+6)(t-s) 2A2 ( A + !) Y 12,34/ Y x _ 8A 2 (A + 1) , 2X+m _ s) 12,23/ y s 

^a, 23 - 2A + 6 1 sj 3(2A + 3) A 1 sj 

_ (2A+2)(t-s) 4A2 ( A + 1) T 12.12 ( y v 2(A + 1) fA + i)f t -,) T ia f y x 

5(2A + 2) " A (Asj+ (A + 2)(A + 5) 6 \ s /° 

Since 

A A ,i^>0, 

^,21,^,22^0 inL 2 , (6.22) 

-4a,23,-4a,3 >0 inL 1 , 

we have 

E[(^ A (t)-^ A ( S )) 2 |J- s ] ^^>t- s inL 1 . (6.23) 



Step 1: The family (Wa)a>o is tight in Cr([0, oo)). Again, we use the Kolmogorov- 
Chentsov criterion. To this end we bound the fourth moments of the increments in 
Wy. 



n(w x (t)-w x (s)) 4 } 

= A 4 -E[(Ti 2 (X t )-T A 2 (X 
<A 4 (E[(T A 2 (X t )-T A 2 (X ) 



(A + l)?l 2 (X s )ds ^ 2 (X t ) + ^(X ))' 



(X + l)Ti A (X s )ds 



(6.24) 



The second term is bounded by Ct 2 for some C > which is independent of A by 
Lemma 15.51 For the first term, we use the Burkholder-Davis-Gundy inequality and 
write (recall Lemma T5.8I and the quadratic variation of the semimartingale ^ 12 (X) = 
(^l 2 {X t )) t > from Remark l2~T4l : 



A 4 -E 



T 12 (X t )-Tl 2 (X )+ / (A + l)T A 2 (X s )d s 



<A 4 E[[* A 2 (*)] t 2 ] 

= A 4 / f S -E[(^ 23 (X r ) - V 12 ' 34 (X r )) ■ E[^ 23 (X S ) - V 12 ' 34 (X s )\T r ]} drds. 
Jo Jo 

(6.25) 

We have to show that the integrand is of order 1/A 4 , if X is in equilibrium. 

First, we compute the conditional expectation using Lemma f5.8| b using J5 . 2 2D and 
obtain 



E[S> 12 - 23 (X S ) - *^(X s )\F r ] 



12,34/ 



E 



1 yl2,12 
10 



a (x s )-lr 12 - 23 (x s )-r 12 < 34 (x s )- 



+ 



(A + 2)(A + 5) 
A 2 



(A + l)(A + 3)(2A + l)(2A + 3) 



(2A+l)(s-r) J_X A 2 ' 12 (X r ) - e~ (2A+3)(s-r) iT A 2,23 (X r ) - e -(2A+6)(s-r)Y 12 . 34 (^ r ) 



10 A 

-(A+l)(s-r) 



(A + 2)(A + 5) 



T 12 (X r 



X 2 



(A + l)(A + 3)(2A + l)(2A + 3)' 



(6.26) 



41 



Abbreviating := T^X^), note that by d5 . 1 9D and the last display, the integrand 
in d6T25T> is a linear combination of terms of the form E[(*^ 2 ' 23 - *^ 2,34 )(*^ - E[*£])] for 
k G {12; 12, 12; 12, 23; 12, 34}. We compute all these terms: 



E 



(*r 3 -*r 4 )(*i 

12,23,45 



A + l 



E[* J 



12,34,56i 



1 



E[ ^12,23 _ $ 12,34 n 



A + 1~ L ~ A A 

4A 3 (5A 2 + 9A - 10) 



(A + 1) 2 (A + 3)(A + 5)(4A 2 + 8A + 3)(9A 3 + 51A 2 

o- 1 



76A + 20) 



E 



A 4 , 

12.23 



12,34 
A 

12,12,34,45 



1 



2A + 1 

12,12,34,56 



E[* 

2A + 1 1 

2A 3 (1576A 6 + C(A 5 )) 



12,23 _ ^12,34j 



3(A + 3)(2A + 1) 2 (4A + 15)(2A 2 + 5A + 3) 2 (96A 5 + d(A 4 )) 



E 



l )(*A 



2.23 



5A + 3 



E 



A / V A 
12.23,45.56 _ ^12,23,45,67 



(A + l)(2A + l)(2A + 3) 

(A + l)(A + 3)(2A + l)(2A + 3) L 
2A 2 (683976A 9 + C(A 8 )) 



9(A + 3)(2A + 1) 2 (2A 2 + 5A + 3) 2 (4A 2 + 41A + 105)(1152A 7 + C(A 6 )) 



E 



(*A - *A 



) * 



12,34 
A 



4A 2 + 18A + 9 



E ^12,23,45,67 



(A + l)(A + 3)(2A+l)(2A + 3) 
4A 2 + 18A + 9 



-E[* 



12,34,56,78i _ 

A 1 (A + 1)(A + 3)(2A + 1)(2A + 3)"" L 

4A 2 (20480A n +C(A 10 )) 



(A + 3) 2 (A + 7)(2A + 1) 2 (2A 2 + 5A + 3) 2 (4608A 9 + C(A 8 )) 



which shows that the integrand on the right hand side of H6.25D is 0(1/A 4 ). 
Hence, we have shown that there is a constant C > such that 

n(Wx(t)-W x (s)) 4 ] <C(t-sf 

and we have shown tightness of {{W\(t)) t >o : A > 0}. 



(6.27) 



(6.28) 



12,34i 



(6.29) 



12,34i 



(6.30) 



(6.31) 



Step 2: If (W t )t>o is a limit point, (W t )t>o as well as (Wf — t) t >o are martingales. Let 
W = (W t )t>o be a weak limit point of {(W\(t)) t >o : A > 0}. The claimed martingale 
properties follow by the same arguments as in Step 3 of the proof of Theorem [2j 
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7 Proof of Theorems [5] and [6] 

7 . 1 Proof of Theorem [5] 

To get a first idea, let us do a little computation. By Fubini's theorem, dominated 
convergence theorem and Lemma [5^81 we get 



E 



lim *l 2 (X t )dt = lim / E[i> 12 (X t )]dt 



A— »-oo 



lim -1- [ T (l-E[m{ 2 (X t )])dt 

A^oo A + 1 J 



(7.1) 



lim 



1 



A->oo A + 1 

thus, since |U t has no atom liniA-s.00 ^ 12 (Xt) = 0, 
P(/j, t has no atom for almost all < * < T) = P^J 



T-E[*^(Jf r )-*r(0)] =0, 



^{limA^oo *i 2 (X t )>0} = ] = 1. 



(7.2) 



Our task is to remove the almost in the P(.) on the left hand side. 



As in the proof of Theorem [3j it suffices to show the assertion if X = X^ . Recall 
that rw{ 2 = 1 - (A+ 1)*^ 2 from Lemma [5~8l Hence, for all A > 0, the process (M x (t)) t >o 
defined as 



(7.3) 



M x {t) := ^{X t ) - *\\X Q ) - I (1 — (A + 1)* A 2 (X S )) ds, 

Jo 

is a continuous martingale with guadratic variation (recall Remark [2.141 1 

[Mx]t= [\^ 23 (X s )-^ 34 (X s ))ds. 



(7.4) 



Using H5.22I I and recalling from Lemma [5T51 that the T's are mean-zero martingales 

lim E[M\(£) 2 ] = lim / E[*^ 2 ' 23 (X S ) - * A 2 M (X S )} ds = 0. (7.5) 

A— >oo A— >oo Jq 



This implies for all e > by Doob's maximal inequality: 

lim P( sup \M x (t)\ > e) = 0. 

A->oo o<t<T 



(7.6) 



Then we can start calculating as follows. For all e > 0, using again Fubini's theorem 
and then 05.23ft , 

lim Pf sup / (1- (\+l)y x 2 (X s ))ds > e) 

A->oo Vo<t<T Jo ' 

lim p( sup ( / (1 - (A + l)^i 2 (X s )) ds) 2 > e 2 ) 

A-s-oo \0<t<T^ Jo ' ) 

(l-(\ + l)V 12 (X s )) 2 ds>e 2 ) 
( ]' (l~(\ + l)*l 2 (X s )) 2 ds>e 2 " 
< lim £ / T E[(1 - (A + l)<S> 12 (X s )f] ds 



< lim PI sup 

A-s-oo V <«rJo 



lim P( 

A— »-oo 



(7.7) 
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Hence, by J7.6D and ( 17.71 1, there is a subsequence A n t 00 with 



sup \M Xn (t)\ -^=>0, (7.8) 

0<t<T 



sup 

0<t<T 



t 

12 



n 



l-(A n + l)^(X s )ds 



n— ^00 ~ 

s> 



almost surely. Recall that we can characterize the existence of atoms by the property 
whether we can draw two points at distance zero or not which we can in turn charac- 
terize by the A — > 00 limit of the sample Laplace transform. Combining d7.3b with 1 17.81 1 
and with I I7.2D (which allows to discard the 'J^ 2 (0)) gives 

POt has no atoms for all < t < T) = P( lim sup (X t ) = 0) 

"^°°0<t<T 

= P ( lim sup (M An (t) + / (1 - (A» + l)*i 2 „ (X s )) da) = o] = 1. 



(7.9) 



7 . 2 Proof of Theorem [6] 

It turns out that the following simple criterion for existence of a mark function is 
useful. 

Lemma 7.1 (Criterion for mark function). 

An mmm-space (U, r, fi) G admits a mark function if there is a sequence e n I with 

lim P(2li = 2l 2 |ra(ili,il 2 ) < e n ) = 1. (7-10) 

n— ^oo 

where (iii,2li), (1X 2 , 2I2) are two independent pairs distributed according to (i. Equiva- 
lently, 

(H® , l{ ni = g 2 } ^-{r(u 1 ,u 2 )<E n }) n-i-oo^ ^ 
(/X® 2 , l{r( Ml ,n 2 )< £ „}> 

We proceed in three steps to prove Theorem [6j 

• Step 1: Proof of Lemma \l .11 

• Step 2: An extension of Theorem [3j 

• Step 3: Combination of Steps 1 and 2 gives Theorem[6] 

Step 1 : Proof of Lemma \7.1\ Since \i is a probability measure on U x A, we can write 
fj,(du, da) = (TTu)*fi(du) ® da) for some probability kernel K from U to A. We have 
to show that K(u, da) only has a single atom for (7T[/)*/t-almost every u. 

We proceed by contradiction and assume that K(u, ■) is not concentrated on a single 
atom for a set U' C U of positive (7r[/)*A*-probability, i.e. ^(U 7 x A) = S > 0. 

Then, for all sequences e„ I 

T (A 4 ® ' -"-{01=02} -"-{«l,«2ef ',r(in,u 2 )<£„}) 1 r/ 1 ,„ - „, 

limsup ' 2 t — ^-=1-5<1. (7.12) 

n->oc \A* ; -"-{mi, u 2 £U' ,r(u 1 u 2 )<e n } / 

Applying this to the pairs (Sti,iXi) and (2l 2 ,iJ-2) gives 
limsup [P(2li = St2|rt/(Ui,ila) < e n ) 

= limsu P P(2li = SfeMUi.lfe) < £n,iXi,iX 2 € t/') • P(Hi,H 2 € C/') 

+ P(2ti=2t 2 |rc;(iii,il 2 ) <e„,iXi t U 2 Vil 2 £ C/') ■ P(ilj G C/'Vil 2 G C/')] 

< (1 -6') -p(iti,ii 2 g u') + p(Ui iu'yd 2 i u 1 ) < 1. 

(7.13) 
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Hence, 1 17.101 1 cannot hold if P(ili,il 2 € U') > 0. This quantity however is by assumption 
bounded below by S 2 > and we have shown Lemma [7TT1 



Step 2: Extension of Theorem® We will show the following for 

*l 2 ((U,r lf i)) := J ^ 2 {du u du 2 ,da u da 2 )t {ai=a2} e- Xr{ - u ^\ (7.14) 
from J2?7l l: For all T < oo and e > 0, 

lim P( sup \(\ + l)$ 12 (X t )-l\ >e) = 0. (7.15) 

\e<t<T J 

In order to see this, it suffices - analogously to the proof of Theorem [3] - to consider 
a = and to start in equilibrium X = X^. First, we assume that /3(-, ■) is non-atomic, 
i.e. d 5 . 2 D from Section [5] holds. In this case, we proceed as in Steps 2 and 3 from 
Section T6.41 These steps rely on Lemmata 1 5 . 1 1 and 15.51 and we can read off from the 
second assertions from these Lemmata that d7. 1 5t holds. Second, consider the gen- 
eral case, i.e. •) is not necessarily non-atomic. It is straight-forward to construct a 
coupling (X t ,X t )t>o such that (X t )t>o and (X t )t>o use the same resampling and muta- 
tion events, where mutant types in (X t )t>o are chosen according to /?(-, ■), but mutant 
types in (X t )t>o are chosen (at the same rate) according to some non-atomic /?(-, ■). If 
X = X , it is clear that mutant types in (X t ) t >o lead to new types in any case and thus, 
the inequality 

$ 12 (X t ) < $ 12 (X t ) < V 12 (X t ) = y 12 (X t ) (7.16) 

holds for all t > 0, almost surely Recall that we have already shown in Theorem [3] 
that I I7.15D holds if ty 12 is replaced by 'J 12 , and since (3(-, •) is non-atomic, it also holds 
for (X t )t>o by our arguments above. Hence, by 1 17. 16b , it also holds for ^ 12 (X t ), i.e. we 
have shown J7.15D . 



Step 3: Combination of Steps 1 and 2 gives Theorem® Fix < 5 < T < oo. It suffices 
to show that X t e U^ ark for all J < t < T. Using Theorem [3] and Step 2 above, take a 
sequence e n I and set A„ := l/e n such that 

P( lim \ n (fif 2 , e~ x " ri2 ) = 1 for all 5 < t < T) = 1, 

P( lim X n {fj,f 2 , l {ai=a , } e- A " ri2 ) = 1 for all 5 < t < T) = 1. 

n— >oo 

We use Lemma TH\\ and the Tauberian result from Lemma [3~4l with its obvious extension 
to "J^ 2 to write: 



P(X t =(U t ,r t ,fH) € VT for all S < t < T) 

> Pf lim ( ^ 2 'l { 9 ai=a2}1{r( " 1, " 2)<E " }) = 1 for all S < t < T 



>P( lim A„(/if 2 , l{ ai=a ,}l{ r ( u !,«,)<£„}) = 1 

and lim A„(^f 2 , l {r(ui , U2 )< £ „ } ) = 1 for all 5 < t < t) {7A8) 
= P( lim X n (fif 2 ,l {ai=a2} e- Xr ^^) = 1 

and lim A„(/if 2 , e - Xr{ui ' U2) ) = 1 for all S < t < T 

= 1 

by ( 17.171 1. This concludes the proof of Theorem [6j 
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